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Preface 


The purpose of this book is to empower the reader with a magnificent new 
perspective on a wide range of fundamental topics in mathematics. Tensor calculus 
is a language with a unique ability to express mathematical ideas with utmost utility, 
transparency, and elegance. It can help students from all technical fields see their 
respective fields in a new and exciting way. If calculus and linear algebra are central 
to the reader’s scientific endeavors, tensor calculus is indispensable. This particular 
textbook is meant for advanced undergraduate and graduate audiences. It envisions 
a time when tensor calculus, once championed by Einstein, is once again a common 
language among scientists. 

A plethora of older textbooks exist on the subject. This book is distinguished 
from its peers by the thoroughness with which the underlying essential elements 
are treated. It focuses a great deal on the geometric fundamentals, the mechanics of 
change of variables, the proper use of the tensor notation, and the interplay between 
algebra and geometry. The early chapters have many words and few equations. The 
definition of a tensor comes only in Chap. 6 — when the reader is ready for it. 

Part III of this book is devoted to the calculus of moving surfaces (CMS). One 
of the central applications of tensor calculus is differential geometry, and there is 
probably not one book about tensors in which a major portion is not devoted to 
manifolds. The CMS extends tensor calculus to moving manifolds. Applications 
of the CMS are extraordinarily broad. The CMS extends the language of tensors 
to physical problems with moving interfaces. It is an effective tool for analyzing 
boundary variations of partial differential equations. It also enables us to bring the 
calculus of variations within the tensor framework. 

While this book maintains a reasonable level of rigor, it takes great care to avoid 
a formalization of the subject. Topological spaces and groups are not mentioned. 
Instead, this book focuses on concrete objects and appeals to the reader’s geometric 
intuition with respect to such fundamental concepts as the Euclidean space, surface, 
length, area, and volume. A few other books do a good job in this regard, including 
[2, 8, 31, 46]. The book [42] is particularly concise and offers the shortest path to 
the general relativity theory. Of course, for those interested in relativity, Hermann 
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Weyl’s classic Space, Time, Matter [47] is without a rival. For an excellent book 
with an emphasis on elasticity, see [40]. 

Along with eschewing formalism, this book also strives to avoid vagueness 
associated with such notions as the infinitesimal differentials dx l . While a number 
of fundamental concepts are accepted without definition, all subsequent elements of 
the calculus are derived in a consistent and rigorous way. 

The description of Euclidean spaces centers on the basis vectors Z, . These 
important and geometrically intuitive objects are absent from many textbooks. Yet, 
their use greatly simplifies the introduction of a number of concepts, including 
the metric tensor Zy = Z,- ■ Z j and Christoffel symbol T l - k = 21 ■ 3Z y /3Z A '. 
Furthermore, the use of vector quantities goes a long way towards helping the 
student see the world in a way that is independent of Cartesian coordinates. 

The notation is of paramount importance in mastering the subject. To borrow 
a sentence from A.J. McConnell [31]: “The notation of the tensor calculus is so 
much an integral part of the calculus that once the student has become accustomed 
to its peculiarities he will have gone a long way towards solving the difficulties 
of the theory itself.” The introduction of the tensor technique is woven into the 
presentation of the material in Chap. 4. As a result, the framework is described in a 
natural context that makes the effectiveness of the rules and conventions apparent. 
This is unlike most other textbooks which introduce the tensor notation in advance 
of the actual content. 

In spirit and vision, this book is most similar to A.J. McConnell’s classic 
Applications of Tensor Calculus [31]. His concrete no-frills approach is perfect for 
the subject and served as an inspiration for this book’s style. Tullio Levi-Civita’s 
own The Absolute Differential Calculus [28] is an indispensable source that reveals 
the motivations of the subject’s co-founder. 

Since a heavy emphasis in placed on vector-valued quantities, it is important to 
have good familiarity with geometric vectors viewed as objects on their own terms 
rather than elements in M" . A number of textbooks discuss the geometric nature of 
vectors in great depth. First and foremost is J.W. Gibbs’ classic [14], which served as 
a prototype for later texts. Danielson [8] also gives a good introduction to geometric 
vectors and offers an excellent discussion on the subject of differentiation of vector 
fields. 

The following books enjoy a good reputation in the modern differential geometry 
community: [3,6,23,29,32,41]. Other popular textbooks, including [38,43] are 
known for taking the formal approach to the subject. 

Virtually all books on the subject focus on applications, with differential 
geometry front and center. Other common applications include analytical dynamics, 
continuum mechanics, and relativity theory. Some books focus on particular appli- 
cations. A case in point is L.V. Bewley’s Tensor Analysis of Electric Circuits And 
Machines [1], Bewley envisioned that the tensor approach to electrical engineering 
would become a standard. Here is hoping his dream eventually comes true. 


Philadelphia, PA 


Pavel Grinfeld 
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Chapter 1 

Why Tensor Calculus? 


“Mathematics is a language.” Thus was the response of the great American scientist 
J. Willard Gibbs when asked at a Yale faculty meeting whether mathematics should 
really be as important a part of the undergraduate curriculum as classical languages. 

Tensor calculus is a specific language within the general language of mathemat- 
ics. It is used to express the concepts of multivariable calculus and its applications in 
disciplines as diverse as linear algebra, differential geometry, calculus of variations, 
continuum mechanics, and, perhaps tensors’ most popular application, general 
relativity. Albert Einstein was an early proponent of tensor analysis and made 
a valuable contribution to the subject in the form of the Einstein summation 
convention. Furthermore, he lent the newly invented technique much clout and 
contributed greatly to its rapid adoption. In a letter to Tullio Levi-Civita, a co- 
inventor of tensor calculus, Einstein expressed his admiration for the subject in the 
following words: “1 admire the elegance of your method of computation; it must be 
nice to ride through these fields upon the horse of true mathematics while the like 
of us have to make our way laboriously on foot.” 

Tensor calculus is not the only language for multivariable calculus and its 
applications. A popular alternative to tensors is the so-called modern language of 
differential geometry. Both languages aim at a geometric description independent 
of coordinate systems. Yet, the two languages are quite different and each offers its 
own set of relative strengths and weaknesses. 

Two of the greatest geometers of the twentieth century, Elie Cartan and Hermann 
Weyl, advised against the extremes of both approaches. In his classic Riemannian 
Geometry in an Orthogonal Frame [5] (see also, [4]), Cartan recommended to“as 
far as possible avoid very formal computations in which an orgy of tensor indices 
hides a geometric picture which is often very simple.” In response, Weyl (my 
personal scientific hero) saw it necessary to caution against the excessive adherence 
to the coordinate free approach [47]: “In trying to avoid continual reference to the 
components, we are obliged to adopt an endless profusion of names and symbols in 
addition to an intricate set of rules for carrying out calculations, so that the balance 
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Why Tensor Calculus? 


of advantage is considerably on the negative side. An emphatic protest must be 
entered against these orgies of formalism which are threatening the peace of even 
the technical scientist.” 

It is important to master both languages and to be aware of their relative strengths 
and weaknesses. The ultimate choice of which language to use must be dictated 
by the particular problem at hand. This book attempts to heed the advice of both 
Cartan and Weyl and to present a clear geometric picture along with an effective 
and elegant analytical technique that is tensor calculus. It is a by-product of the 
historical trends on what is in fashion that tensor calculus has presently lost much 
of its initial popularity. Perhaps this book will help this magnificent subject to make 
a comeback. 

Tensor calculus seeks to take full advantage of the robustness of coordinate 
systems without falling subject to the artifacts of a particular coordinate system. 
The power of tensor calculus comes from this grand compromise: it approaches 
the world by introducing a coordinate system at the very start — however, it never 
specifies which coordinate system and never relies on any special features of the co- 
ordinate system. In adopting this philosophy, tensor calculus finds its golden mean. 

Finding this golden mean is the primary achievement of tensor calculus. Also 
worthy of mention are some of the secondary benefits of tensor calculus: 

A. The tensor notation, even detached from the powerful concept of a tensor, 
can often help systematize a calculation, particularly if differentiation is involved. 
The tensor notation is incredibly compact, especially with the help of the Einstein 
summation convention. Yet, despite its compactness, the notation is utterly robust 
and surprisingly explicit. It hides nothing, suggests correct moves, and translates to 
step-by-step recipes for calculation. 

B. The concept of a tensor arises when one aims to preserve the geometric 
perspective and still take advantage of coordinate systems. A tensor is an encoded 
geometric object in a particular coordinate system. It is to be decoded at the right 
time: when the algebraic analysis is completed and we are ready for the answer. 
From this approach comes the true power of tensor calculus: it combines, with 
extraordinary success, the best of both geometric and algebraic worlds. 

C. Tensor calculus is algorithmic. That is, tensor calculus expressions, such as 
A,V“ Z' a for mean curvature of a surface, can be systematically translated into 
the lower-level combinations of familiar calculus operations. As a result, it is 
straightforward to implement tensor calculus symbolically. Various implementa- 
tions are available in the most popular computer algebra systems and as stand-alone 
packages. 

As you can see, the answer to the question Why Tensor Calculus? is quite 
multifaceted. The following four motivating examples continue answering this 
question in more specific terms. If at least one of these examples resonates with 
the reader and compels him or her to continue reading this textbook, then these 
examples will have accomplished their goal. 
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Motivating Example 1: The Gradient 


What is the gradient of a function F and a point PI You are familiar with two 
definitions, one geometric and one analytical. According to the geometric definition, 
the gradient V F of F is the vector that points in the direction of the greatest increase 
of the function F , and its magnitude equals the greatest rate of increase. According 
to the analytical definition that requires the presence of a coordinate system, the 
gradient of F is the triplet of numbers 


VF = 



(1.1) 


Are the two definitions equivalent in some sense? If you believe that the 
connection is 


„ dF . dF . 

VF “ aT + a7 J ’ 


(1.2) 


where i and j are the coordinate basis, you are in for a surprise! Equation (1.2) can 
only be considered valid if it produces the same vector in all coordinate systems. You 
may not be familiar with the definition of a coordinate basis in general curvilinear 
coordinates, such as spherical coordinates. The appropriate definition will be given 
in Chap. 5. However, equation (1.2) yields different answers even for the two 
coordinate systems in Fig. 1.1. 

For a more specific example, consider a temperature distribution T in a two- 
dimensional rectangular room. Refer the interior of the room to a rectangular 
coordinate system x, y where the coordinate lines are one meter apart. This 
coordinate system is illustrated on the left of Fig. 1.1. Express the temperature field 
in terms of these coordinates and construct the vector gradient VT according to 
equation (1.2). 







j'' 





i' 


































j 









‘i 





























Fig. 1.1 When the expression in equation (1.2) is evaluated in these two coordinate systems, the 
results are not the same. 
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Alternatively, refer the interior of the room to another rectangular system x' , y', 
illustrated on the right of Fig. 1.1), whose coordinate lines are two meters apart. For 
example, at a point where x = 2, the new coordinate x' equals 1. Therefore, the 
new coordinates and the old coordinates are related by the identities 

x = lx 1 y = 2 y'. (1.3) 


Now repeat the construction of the gradient according to equation (1.2) in the new 
coordinate system: refer the temperature field to the new coordinates, resulting in 
the function F' (x', y'), calculate the partial derivatives and evaluate the expression 
in equation (1.2), except with “primed” elements: 


dF' dF' 

,vr) = a? l ' + V J '- 


(1.4) 


How does VT compare to (VT/? The magnitudes of the new coordinate vectors 
i' and j' are double those of the old coordinate vectors i and j. What happens to 
the partial derivatives? Do they halve (this would be good) or do they double (this 
would be trouble)? 

They double! This is because in the new coordinates, quantities change twice 
as fast. In evaluating the rate of change with respect to, say, x, one increments 
x by a small amount Ax, such as Ax = 10 -3 , and determines how much the 
function F (x, y) changes in response to that small change in x. When one evaluates 
the partial derivative with respect to x' in the new coordinate system, the same 
increment in the new variable x’ is, in physical terms, twice as large. It results in 
twice as large a change AF' in the function F'(x',y'). Therefore, AF' / Ax' is 
approximately twice as large as AF / Ax and we conclude that partial derivatives 
double: 


BF’(x’,y') _ JF(x,y) 

dx' 3x ' ( ' 

Thus, the relationship between (VT)' and VT reads (VT)' = 4 VT and the 
two results are different. Therefore, equation (1.2) cannot be used as the analytical 
definition of the gradient because it yields different results in different coordinate 
systems. 

Tensor calculus offers a solution to this problem. Indeed, one of the central goals 
of tensor calculus is to construct expressions that evaluate to the same result in all 
coordinate systems. The fix to the gradient can be found in Sect. 6.8 of Chap. 6. 

Exercise 1. Suppose that the temperature field T is given by the function 
F (x,y) = x 2 e y in coordinates x,y. Determine the function F (x',y'), which 
gives the temperature field T in coordinates x',y'. 

Exercise 2. Confirm equation (1.5) for the functions F(x,v) and F'(x',y') 
derived in the preceding exercise. 
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Exercise 3. Show that the expression 


vr 


1 dF 1 dF 

T 7 ! dx + VY1 d y 


( 1 . 6 ) 


yields the same result for all rescalings of Cartesian coordinates. 

Exercise 4. Show that equation (1.2) yields the same expression in all Cartesian 
coordinates. The key to this exercise is the fact that any two Cartesian coordinate 
systems x , y and x', y' are related by the equation 


X 

— 

a 

+ 

COS O' 

— sin o' 



y . 


_b_ 


_ sin o' 

COS O' 




What is the physical interpretation of the numbers a, b, and al 

Exercise 5. Conclude that equation ( 1 .6) is valid for all orthogonal affine coordi- 
nate systems. Affine coordinates are those with straight coordinate lines. 


Motivating Example 2: The Determinant 

In your linear algebra course, you studied determinants and you certainly learned 
the almost miraculous property that the determinant of a product is a product of 
determinants: 


\AB\ = |*| . 


( 1 - 8 ) 


This relationship can be explained by the deep geometric interpretation of the 
determinant as the signed volume of a parallelepiped. Some textbooks — in particular 
the textbook by Peter Lax [27] — take this approach. Other notable textbooks (e.g., 
Gilbert Strang’s), derive (1.8) by row operations. Yet other classics, including Paul 
Halmos’s [22] and Israel M. Gelfand’s [13], prove identity (1.8) by analyzing the 
complicated algebraic definition of the determinant. 

In tensor notation, the argument found in Halmos and Gelfand can be presented 
as an elegant and beautiful calculation. This calculation can be found in Chap. 9 and 
fits in a single line 


C 


— X ijk r' r s r t - 
2 j °rst c i c j c k 


y^ersMa^a^^ABe,,, 


-AB, Q.E.D. 


(1-9) 


You will soon learn to carry out such chains with complete command. You will 
find that, in tensor notation, many previously complicated calculations become quite 
natural and simple. 
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Motivating Example 3: Vector Calculus 

Hardly anyone can remember the key identities from vector calculus, let alone derive 
them. Do you remember this one? 


A x (V x B) = V B (A • B) - (A • V) B? 


( 1 . 10 ) 


(And, more importantly, do you recall how to interpret the expressions V B (A • B) 
and (A • V) B?) I will admit that I do not have equation (1.10), either. However, 
when one knows tensor calculus, one need not memorize identities such as (1.10). 
Rather, one is able to derive and interpret them on the fly: 


e rsi A s e ijk W j B k = (s r jS s k - A s V j B k = A S VB S - A, V s B ' . (1.11) 


In equation (1.1 1) we see Cartan’s orgy of formalism of equation (1.10) replaced by 
Weyl’s orgy of indices. In this particular case, the benefits of the tensor approach 
are evident. 


Motivating Example 4: Shape Optimization 

The problem of finding a surface with the least surface area that encloses the domain 
of a given volume, the answer being a sphere, is one of the oldest problems in 
mathematics. It is now considered to be a classical problem of the calculus of 
variations. Yet, most textbooks on the calculus of variations deal only with the two- 
dimensional variant of this problem, often referred to as the Dido Problem or the 
problem of Queen Dido, which is finding a curve of least arc length that incloses a 
domain of a given area. The full three-dimensional problem is usually deemed to be 
too complex, while the treatment of the two-dimensional problem often takes two 
or more pages. 

The calculus of a moving surface — an extension of tensor calculus to deforming 
manifolds, to which Part III of this textbook is devoted — solves this problem 
directly, naturally, and concisely. The following derivation is found in Chap. 17 
where all the necessary details are given. Here, we give a general outline merely to 
showcase the conciseness and the elegance of the analysis. 

The modified objective function E incorporating the isoperimetric constraint for 
the surface S enclosing the domain Q reads 



( 1 . 12 ) 


where A is a Lagrange multiplier. The variation SE with respect to variations C in 
shape is given by 
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8E = [ C (~B“ + X)dS, (1.13) 

Js 

where B“ is mean curvature. Since C can be treated as an independent variation, 
the equilibrium equation states that the optimal shape has constant mean curvature 

B“ = k. (1.14) 


Naturally, a sphere satisfies this equation. 

This derivation is one of the most vivid illustrations of the power of tensor cal- 
culus and the calculus of moving surfaces. It shows how much can be accomplished 
with the help of the grand compromise of tensor calculus, including preservation of 
geometric insight while enabling robust analysis. 



Part I 

Tensors in Euclidean Spaces 



Chapter 2 

Rules of the Game 


2.1 Preview 

According to the great German mathematician David Hilbert,“mathematics is a 
game played according to certain simple rules with meaningless marks on paper.” 
The goal of this chapter is to lay down the simple rules by which the game of tensor 
calculus is played. 

We take a relatively informal approach to the foundations of our subject. We 
do not mention JR", groups, isomorphisms, homeomorphisms and polylinear forms. 
Instead, we strive to build a common context by appealing to concepts that we 
find intuitively clear. Our goal is to establish an understanding of Euclidean spaces 
and of the fundamental operations that take place in a Euclidean space — most 
notably, operations on vectors including differentiation of vectors with respect to 
a parameter. 


2.2 The Euclidean Space 

All objects discussed in this book live in a Euclidean space. 

What is a Euclidean space? There is a formal definition, but for us an informal 
definition will do: a Euclidean space corresponds to the physical space of our 
everyday experience. There are many features of this space that we take for granted. 
The foremost of these is its ability to accommodate straight-edged objects. Take a 
look at Rodin’s “The Thinker” ( Le Penseur) in Fig. 2.1. Of course, the sculpture 
itself does not have a single straight line in it. But the gallery in which it is housed 
is a perfect rectangular box. 

Our physical space is called Euclidean after the ancient Greek mathematician 
Euclid of Alexandria who lived in the fourth century BC. He is the author of 
the Elements [11], a book in which he formulated and developed the subject of 
geometry. The Elements reigned as the supreme textbook on geometry for an 
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Fig. 2.1 Rodin’s “Thinker” finds himself in a Euclidean space characterized by the possibility of 
straightness 



Fig. 2.2 A page from The Elements 


astoundingly long period of time: since its creation and until the middle of the 
twentieth century. The Elements set the course for modern science and earned Euclid 
a reputation as the “Father of Geometry”. 

Euclid’s geometry is based on the study of straight lines and planes. Figure 2.2 
shows a page from one of the oldest (circa 100AD) surviving copies of The 
Elements. That page contains a drawing of a square and an adjacent rectangle. Such 
a drawing may well be found in a contemporary geometry textbook. The focus is on 
straight lines. 




2.3 Length, Area, and Volume 
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Not all spaces have the ability to accommodate straight lines. Only the straight 
line, the plane, and the full three-dimensional space are Euclidean. The rest, such 
as the surface of a sphere, are not. Some curved spaces display only certain features 
of Euclidean spaces. For example, the surface of a cylinder, while non-Euclidean, 
can be cut with a pair of scissors and unwrapped into a flat Euclidean space. The 
same can be said of the surface of a cone. We know, on an intuitive level, that 
these are essentially flat surfaces arranged, with no distortions, in the ambient three- 
dimensional space. We have not given the term “essentially flat” a precise meaning, 
but we will do so in Chap. 12. 


2.3 Length, Area, and Volume 

Are you willing to accept the concept of a Euclidean space without a formal 
definition? If so, you should similarly accept two additional geometric concepts: 
length of a segment and angle between two segments. 

The concept of area can be built up from the concept of length. The area of a 
rectangle is the product ab of the lengths of its sides. A right triangle is half of a 
rectangle, so its area is the product ab of the lengths of its sides adjacent to the right 
angle. The area of an arbitrary triangle can be calculated by dividing it into two right 
triangles. The area of a polygon can be calculated by dividing it into triangles. 

The concept of volume is similarly built up from the concept of length. The 
volume of a rectangular box is the product abc of the lengths of its sides. The 
volume of a tetrahedron is Ah/ 3 where h is the height and A is the area of the 
opposite base. Interestingly, the geometric proof of this formula is not simple. The 
volume of a polyhedron can be calculated by dividing it intro tetrahedra. 

Our intuitive understanding of lengths, areas, and volumes also extends to curved 
geometries. We understand the meaning of the surface area of a sphere or a cylinder 
or any other curved surface. That is not to say that the formalization of the notion 
of area is completely straightforward. Even for the cylinder, an attempt to define 
area as the limit of surface areas of inscribed polyhedra was met with fundamental 
difficulties. An elementary example [39,48], in which the inscribed areas approach 
infinity, was put forth by the German mathematician Hermann Schwarz. 

However, difficulties in formalizing the concepts of surface area and volume 
did not prevent mathematicians from to calculating those quantities effectively. An 
early and fundamental breakthrough came from Archimedes who demonstrated that 
the volume of a sphere is | jtR 3 and its surface area is 4 jtR 2 . The final word on 
the subject of quadrature (i.e., computing areas and volumes) came nearly 2,000 
years later in the form of Isaac Newton's calculus. We reiterate that difficulties in 
formalizing the concepts of length, area, and volume and difficulties in obtaining 
analytical expressions for lengths, areas, and volumes should not prevent us from 
discussing and using these concepts. 

In our description of space, length is a primary concept. That is, the concept of 
length comes first and other concepts are defined in terms of it. For example, below 
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we define the dot product between two vectors as the product of their lengths and 
the cosine of the angle between them. Thus, the dot product a secondary concept 
defined in terms of length. Meanwhile, length is not defined in terms of anything 
else — it is accepted without a definition. 


2.4 Scalars and Vectors 

Scalar fields and vector fields are ubiquitous in our description of natural phenom- 
ena. The term field refers to a collection of scalars or vectors defined at each point 
of a Euclidean space or subdomain Q. A scalar is a real number. Examples of 
scalar fields include temperature, mass density, and pressure. A vector is a directed 
segment. Vectors are denoted by boldface letters such as V and R. Examples of 
vector fields include gravitational and electromagnetic forces, fluid flow velocity, 
and the vector gradient of a scalar field. This book introduces a new meaningful 
type of field — tensor fields — when the Euclidean space is referred to a coordinate 
system. 


2.5 The Dot Product 

The dot product is an operation of fantastic utility. For two vectors U and V, the dot 
product U • V is defined as the product of their lengths |U| and | V| and the cosine of 
the angle between them: 


U- V = |U| |V| cos a. 


(2.1) 


This is the only sense in which the dot product is used in this book. The great utility 
of the dot product comes from the fact that most, if not all, geometric properties can 
be expressed in terms of the dot product. 

The dot product has a number of fundamental properties. It is commutative 


U- V = YU 


( 2 . 2 ) 


which is clear from the definition (2.1). It is also linear in each argument. With 
respect to the first argument, linearity means 


(Ui + U 2 ) ■ V = Ui • V + U 2 • V. 


(2.3) 


This property is also known as distributivity. Unlike commutativity, which imme- 
diately follows from the definition (2.1), distributivity is not at all obvious and the 
reader is invited to demonstrate it geometrically. 


2.6 The Directional Derivative 


15 


Exercise 6. Demonstrate geometrically that the dot product is distributive. 

As we mentioned above, most geometric quantities can be expressed in terms of 
the dot product. The length |U| of a vector U is given by 

|U| = Vu-U. (2.4) 

Similarly, the angle a between two vectors U and V, is given by 


u • v 

a = arccos . (2.5) 

VtTuVv^v 

The definition (2.1) of the dot product relies on the concept of the angle between 
two vectors. Therefore, angle may be considered a primary concept. It turns out, 
however, that the concept of angle can be derived from the concept of length. The 
dot product U • Y can be defined in terms of lengths alone, without a reference to 
the angle a: 


IU + VI 2 - IU-VI 2 

U-V=- -. (2.6) 

Thus, the concept of angle is not needed for the definition of the dot product. Instead, 
equation (2.5) can be viewed as the definition of the angle between U and V. 


2.5.1 Inner Products and Lengths in Linear Algebra 

In our approach, the concept of length comes first and the dot product is built upon 
it. In linear algebra, the dot product is often referred to as the inner product and 
the relationship is reversed: the inner product (U, V) is any operation that satisfies 
three properties (distributivity, symmetry, and positive definiteness), and length |U| 
is defined as the square root of the inner product of U with itself 

|U| = 7(U,U). (2.7) 

Thus, in linear algebra, the inner product is a primary concept and length is a 
secondary concept. 


2.6 The Directional Derivative 

A directional derivative measures the rate of change of a scalar field F along a 
straight line. Let a straight ray / emanate from the point P . Suppose that P* is 
a nearby point on / and let P* approach P in the sense that the distance PP* 
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approaches zero. Then the directional derivative dF / dl at the point P is defined as 
the limit 


dF (P ) F(P*)-F(P) 

— = lim 

dl p*-*p PP* 


(2.8) 


Instead of a ray, we could consider an entire straight line, but we must still pick a 
direction along the line. Additionally, we must let the distance PP* be signed. That 
is, we agree that PP* is positive if the direction from P to P* is the same as the 
chosen direction of the line /, and negative otherwise. 

The beauty of the directional derivative is that it is an entirely geometric concept. 
Its definition requires two elements: straight lines and length. Both are present in 
Euclidean spaces, making definition (2.8) possible. Importantly, the definition of a 
directional derivative does not require a coordinate system. 

The definition of the direction derivative can be given in terms of the ordinary 
derivative of a scalar function. This can help avoid using limits by hiding the concept 
of the limit at a lower level. Parameterize the points P * along the straight line 1 by 
the signed distance s from P to P* . Then the values of function of s in the sense 
of ordinary calculus. Let us denote that function by / ( s ), using a lower case / to 
distinguish it from F, the scalar field defined in the Euclidean space. Then, as it is 
easy to see 


dF(P) 

dl 


f ( 0 ) . 


(2.9) 


where the derivative of / (s) is evaluated in the sense of ordinary calculus. 
Definition (2.9) assumes that the parameterization is such that s = 0 at P . More 
generally, if the point P is located at s = sq , then the definition of dF /dl is 


dF(P) 

dl 


f (so) • 


(2.10) 


The following exercises are meant reinforce the point that directional derivatives 
can be evaluated without referring the Euclidean space to a coordinate system. 

Exercise 7. Evaluate dF /dl for F ( P ) = “Distance from point P to point A” in 
a direction perpendicular to AP . 

Exercise 8. Evaluate dF/dl for F ( P ) = “l/(Distance from point P to point A)" 
in the direction from P to A. 

Exercise 9. Evaluate dF/dl for F ( P ) = “Angle between OA and OP ”, where 
O and A are two given points, in the direction from P to A . 

Exercise 10. Evaluate dF/dl for F (P) = “Distance from P to the straight line 
that passes through A and 5”, where A and B are given points in the direction 
parallel to AB . The distance between a point P and a straight line is defined as the 
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shortest distance between P and any of the points on the straight line. The same 
definition applies to the distance between a point and a curve. 

Exercise 11. Evaluate dF/dl for F (P) = “Area of triangle PAB”, where A and 
B are fixed points, in the direction parallel to A B . 

Exercise 12. Evaluate dF/dl for F (P) = “Area of triangle PAB ”, where A and 
B are fixed points, in the direction orthogonal to A B . 


2.7 The Gradient 


The concept of the directional derivative leads to the concept of the gradient. The 
gradient V F of F is defined as a vector that points in the direction of the greatest 
increase in F . That is, it points in the direction / along which dF/dl has the 
greatest value. The length of the gradient vector equals the rate of the increase, 
that is |VF| = dF/dl. Note that the symbol V in VF is bold indicating that the 
gradient is a vector quantity. Importantly, the gradient is a geometric concept. For a 
given scalar field, it can be evaluated, at least conceptually, by pure geometric means 
without a reference to a coordinate system. 

Exercise 13. Describe the gradient for each of the functions from the preceding 
exercises. 

The gradient V F and the directional derivative dF/dl along the ray l are related 
by the dot product: 



( 2 . 11 ) 


where L is the unit vector in the direction of the ray / . This is a powerful relationship 
indeed and perhaps somewhat unexpected. It shows that knowing the gradient is 
sufficient to determine the directional derivatives in all directions. In particular, the 
directional derivative is zero in any direction orthogonal to the gradient. 

If we were to impose a coordinate system on the Euclidean space (doing so 
would be very much against the geometric spirit of this chapter), we would see 
equation (2.1 1) as nothing more than a special form of the chain rule. This approach 
to equation (2.11) is discussed in Chap. 6 after we have built a proper framework for 
operating safely in coordinate systems. Meanwhile, you should view equation (2.11) 
as a geometric identity 

Exercise 14. Give the geometric intuition behind equation (2.11). In other words, 
explain geometrically, why knowing the gradient is sufficient to determine the 
directional derivatives in all possible directions. 
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2.8 Differentiation of Vector Fields 

In order to be subject to differentiation, a field must consist of elements that posses 
three basic properties. First, the elements can be added together to produce another 
element of the same kind. Second, the elements can be multiplied by real numbers. 
Can geometric vectors be added together and multiplied by real numbers? Certainly, 
yes. So far, geometric vectors are good candidates for differentiation with respect to 
a parameter. 

The third property is the ability to approach a limit. The natural definition of 
limits for vector quantities is based on distance. If vector A depends on a parameter 
h, we say that A (/;) — > B as h — > 0, if 

lint I A (h) — B| = 0. (2.12) 

h -*0 

In summary, geometric vectors posses the three required properties. Thus, geometric 
vectors can be differentiated with respect to a parameter. 

Consider the radial segment on the unit circle and treat it as a vector function 
R (a) of the angle a. In order to determine R' (a), we appeal to the definition (2.8). 
We take ever smaller values of h and construct the ratio (R (a + h ) — R (a)) / h 
in Fig. 2.3. The figure shows the vectors R (a) and R(a + /;), their diminishing 
difference AR = R(a + h) — R(a), as well as the quantity AR/h. It is apparent 
that the ratio AR / h converges to a specific vector. That vector is R' (a). 



Fig. 2.3 A limiting process that constructs the derivative of the vector R (a) with respect to a 


2.9 Summary 
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In practice, it is rare that essentially geometric analysis can produce an answer. 
For example, from the example above, we may conjecture that R' (a) is orthogonal 
to R' (a) and of unit length, but it is nontrivial to demonstrate so convincingly in 
purely geometric terms. Certain elements of the answer can be demonstrated without 
resorting to algebraic analysis in a coordinate system, but typically not all. 

In our current example R' (a) can be determined completely without any 
coordinates. To show orthogonality, note that R(a) is the unit length, which can 
be written as 


R(a) • R (a) = 1. (2.13) 

Differentiating both sides of this identity with respect to a, we have, by the product 
rule 


R' {a) ■ R (a) + R (a) • R' (ct) = 0, (2.14) 

where we have assumed, rather reasonably but without a formal justification, that 
the derivative of a scalar product satisfies the product rule. We therefore have 

R (a) • R' (a) = 0, (2.15) 


showing orthogonality. 

To show that R' (a) is unit length, note that the length of the vector R (a + h) — 
R (a) is 2 sin |. Thus the length of R' (a) is given by the limit 


lim 
h—> 0 


2 sin | 
h 


= 1 


(2.16) 


and we have confirmed that R' (a) is orthogonal to R (a) and is unit length. In 
the next chapter, we rederive this result with the help of a coordinate system as an 
illustration of the utility of coordinates. 

Exercise 15. Show that the length of R (a + h) — R (a) is 2 sin |. 

Exercise 16. Confirm that the limit in (2.16) is 1 by L’Hopital’s rule. 

Exercise 17. Alternatively, recognize that the limit in (2.16) equals sin ; 0 = 
cos 0=1. 

Exercise 18. Determine R" (a). 


2.9 Summary 

In this chapter, we introduced the fundamental concepts upon which the subject 
of tensor calculus is built. The primary elements defined in the Euclidean space 
are scalar and vector fields. Curve segments, two-dimensional surface patches, 
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and three-dimensional domains are characterized lengths, areas, and volumes. This 
chapter dealt with geometric objects that can be discussed without a reference 
to a coordinate system. However, coordinate systems are necessary since the 
overwhelming majority of applied problems cannot be solved by geometric means 
alone. When coordinate systems are introduced, tensor calculus serves to preserve 
the geometric perspective by offering a strategy that leads to results that are 
independent of the choice of coordinates. 



Chapter 3 

Coordinate Systems and the Role 
of Tensor Calculus 


3.1 Preview 

Tensor calculus was invented in order to make geometric and analytical methods 
work together effectively. While geometry is one of the oldest and most developed 
branches of mathematics, coordinate systems are relatively new, dating back to the 
1600s. The introduction of coordinate systems enabled the use of algebraic methods 
in geometry and eventually led to the development of calculus. However, along with 
their tremendous power, coordinate systems present a number of potential pitfalls, 
which soon became apparent. Tensor calculus arose as a mechanism for overcoming 
these pitfalls. In this chapter, we discuss coordinate systems, the advantages and 
disadvantages of their use, and explain the need for tensor calculus. 


3.2 Why Coordinate Systems? 

Coordinate systems make tasks easier! The invention of coordinates systems was 
a watershed event in seventeenth-century science. Joseph Lagrange (1736-1813) 
described the importance of this event in the following words: “As long as algebra 
and geometry have been separated, their progress has been slow and their uses 
limited, but when these two sciences have been united, they have lent each mutual 
forces, and have marched together towards perfection.” Coordinate systems enabled 
the use of powerful algebraic methods in geometric problems. This in turn led 
to the development of new algebraic methods inspired by geometric insight. This 
unprecedented wave of change culminated in the invention of calculus. Prior to 
the invention of coordinate systems, mathematicians had developed extraordinary 
skill at solving problems by geometric means. The invention of coordinates and the 
subsequent invention of calculus opened up problem solving to the masses. 

The individual whose name is most closely associated with the invention of 
coordinate systems is Rene Descartes (1596-1650), whose portrait is found in 
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Fig. 3.1 Rene Descartes is 
credited with the invention of 
coordinate systems. His use 
of coordinates was brilliant 



Fig. 3.1. We do not know if Descartes was the first to think of assigning numbers 
to points, but he was the first to use this idea to incredible effect. For example, the 
coordinate system figured prominently in Descartes’ tangent line method, which he 
described as “Not only the most useful and most general problem in geometry that 
I know, but even that I have ever desired to know.” The original description of the 
method may be found in Descartes’ masterpiece The Geometry [9]. An excellent 
historical perspective can be found in [15]. 

An example from Chap. 2 provides a vivid illustration of the power of coor- 
dinates. In Sect. 2.8, we considered a vector R(o;) that traces out the unit as the 
parameter a changes from 0 to 2n. We concluded, by reasoning geometrically, 
that the derivative R' (a) is unit length and orthogonal to R(a). Our derivation 
took a certain degree of geometric insight. Even for a slightly harder problem — 
for example, R (a) tracing out an ellipse instead of a circle — R' (a) would be much 
harder to calculate. 

With the help of coordinates, the same computation becomes elementary and 
even much harder problems can be handled with equal ease. Let us use Cartesian 
coordinates and denote the coordinate basis by {i, j}. Then, R (a) is given by 

R («) = icosa + j sina, (3.1) 

which yields R' (a) by a simple differentiation: 


R ; (a) = — i sina + j cos a. 


(3.2) 



3.3 What Is a Coordinate System? 
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That’s it: we found R' (a) in one simple step. To confirm that R' (a) is orthogonal 
to R (a), dot R' (a) with R (a): 

R' (a) ■ R (a) = — sin a cos a + sin a cos a = 0. (3.3) 

To confirm that R' (a) is unit length, compute the dot product 


R' (a) • R' (a) = sin 2 a + cos 2 a = 1 . (3.4) 

Even this simple example shows the tremendous utility of operating within a 
coordinate system. On the other hand, coordinates must be handled with skill. The 
use of coordinates comes with its own set of pitfalls. Overreliance of coordinates is 
often counterproductive. Tensor calculus was born out of the need for a systematic 
and disciplined strategy for using coordinates. In this chapter, we will touch upon 
the difficulties that arise when coordinates are used inappropriately. 


3.3 What Is a Coordinate System? 

A coordinate system assigns sets of numbers to points in space in a systematic 
fashion. The choice of the coordinate system is dictated by the problem. If a problem 
can be solved with the help of one coordinate system it may also be solved with 
another. However, the solution may be more complicated in one coordinate system 
than the other. For example, we can refer the surface of the Earth to spherical 
coordinates (described in Sect. 3.6.5). When choosing a spherical coordinate system 
on a sphere, one needs to make two decisions — where to place the poles and where 
the azimuth count starts. The usual choice is a good one: the coordinate poles 
coincide with the North and South poles of the Earth, and the main meridian passes 
through London. 

This coordinate system is convenient in many respects. For example, the length 
of day can be easily determined from the latitude and the time of year. Climate 
is very strongly tied to the latitude as well. Time zones roughly follow meridians. 
Centripetal acceleration is strictly a function of the latitude. It is greatest at the 
equator (6 = tv/2). 

Imagine what would happen if the coordinate poles were placed in Philadelphia 
( 6 = 0) and the opposite pole (6 = n) in the middle of the ocean southwest of 
Australia? Some tasks would become easier, others harder. For example, calculating 
the distance to Philadelphia would become a very simple task: the point with 
coordinates (0,<p) is R6 miles away, where R is the radius of the Earth. On the 
other hand, some of the more important tasks, like determining the time of day, 
would become substantially more complicated. 
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3.4 Perils of Coordinates 

It may seem then, that the right strategy when solving a problem is to pick the right 
coordinate system. Not so! The conveniences of coordinate systems come with great 
costs including loss of generality and loss of geometric insight. This can quite often 
be the difference between succeeding and failing at solving the problem. 


Loss of Geometric Insight 

A famous historical episode can illustrate how an analytical calculation, albeit 
brilliant, can fail to identify a simple geometric picture. Consider the task of finding 
the least surface area that spans a three-dimensional contour U . Mathematically, the 
problem is to minimize the area integral 

A = J dS (3.5) 

over all possible surfaces S for which the contour boundary U is specified. Such 
a surface is said to be minimal. The problem of finding minimal surfaces is a 
classical problem in the calculus of variations. Leonhard Euler laid the foundation 
for this subject in 1744 in the celebrated work entitled The method of finding plane 
curves that show some property of maximum and minimum [12]. Joseph Lagrange 
advanced Euler’s geometric ideas by formulating an analytical method in Essai 
d’une nouvelle methode pour determiner les maxima et les minima des formules 
integrates indefinies [26] published in 1762. Lagrange derived a relationship that a 
surface in Cartesian coordinates (x, y, z) represented by 

z = F (x, y) (3.6) 

must satisfy. That relationship reads 

F XX + Fyy + F xx F 2 + Fyy F 2 - 2 F x F y F xy = 0. (3.7) 

The symbols F xx denotes d 2 F/dx 2 and the rest of the elements follow the same 
convention. In the context of calculus of variations, equation (3.7) is known as the 
Euler-Lagrange equation for the functional (3.5). 

What is the geometric meaning of equation (3.7)? We have the answer now but, 
as hard as it is to believe, neither Lagrange nor Euler knew it! The geometric insight 
came 14 years later from a young French mathematician Jean Baptiste Meusnier. 
Meusnier realized that minimal surfaces are characterized by zero mean curvature. 
We denote mean curvature by B“ and discuss it thoroughly in Chap. 12. For a 
surface given by (3.6) in Cartesian coordinates, mean curvature B“ is given by 
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Fxx + Fyy + F XX F l + FyyFl - 2 F x F y F Xy 

B a = 7372 ' ^ 3 - 8 ) 

(l + FI + F*) 

Therefore, as Meusnier discovered, minimal surfaces are characterized by zero 
mean curvature: 


B a a = 0. (3.9) 

The history of calculus of variations is one of shining success and has bestowed 
upon its authors much deserved admiration. Furthermore, we can learn a very 
valuable lesson from the difficulties experienced even by the great mathematicians. 
By choosing to operate in a particular coordinate system, Lagrange purposefully 
sacrificed a great deal of geometric insight in exchange for the power of analytical 
methods. In doing so, Lagrange was able to solve a wider range of problems than 
the subject’s founder Euler. 


Analytic Complexity 

As we can see, coordinate systems are indispensable for problem solving. However, 
the introduction of a particular coordinate system must take place at the right 
moment in the solution of a problem. Otherwise, the expressions that one encounters 
can become impenetrably complex. The complexity of the expression (3.8) for mean 
curvature speaks to this point. Lagrange was able to overcome the computational 
difficulties that arose in obtaining equation (3.7). Keep in mind, however, that 
equation (3.7) corresponds to one of the simplest problems of its kind. Imagine 
the complexity that would arise is the analysis of a more complicated problem. 
For example, consider the following problem that has important applications for 
biological membranes: to find the surface S with a given contour boundary U that 
minimizes the integral of the square of mean curvature: 

E = J {B “) 2 dS. (3.10) 

In Cartesian coordinates, E is given by the expression 

f (F XX + Fyy + F XX F^ + FyyFl ~ 2 F X FyF X y) 

E = / 2 3 —dxdy. (3.11) 

h (l + F 2 + F$) 

As it is shown in Chap. 17, the Euler-Lagrange equation corresponding to equa- 
tion (3.10) reads 


V^B a a - - 2 = ° 


(3.12) 
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Fig. 3.2 The Laplacian of mean curvature in Cartesian coordinates 


The first term in this equation is V/j V'” B“, the surface Laplacian of mean 

curvature 5“. The surface Laplacian is discussed in Chap. 11. Let us now try to 
imagine the mayhem that the Euler-Lagrange equation would be if the standard 
analysis of calculus of variations were applied directly to equation (3.11). The 
expression for V^5“, a relatively simple geometric quantity, is shown in Fig. 3.2. 
Despite its apparent impenetrability, it is endowed with geometric meaning: it is 
the Laplacian of mean curvature. The Laplacian of mean curvature is an important 
object that arises often in the analysis of surfaces. We shall denote it by V„V“S^. It 
is hard to image that someone would carry their analysis this far! If one did, it is even 
harder to imagine that someone would be able to recognize this expression as the 
Laplacian of mean curvature! These unmanageable expressions arise because the 
formulation of the minimization problem (3.1 1) in a Cartesian coordinate system — 
or any other particular coordinate system, for that matter — is ineffective. 
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3.5 The Role of Tensor Calculus 


The central idea of tensor calculus is to acknowledge the need for coordinate 
systems and, at the same time, to avoid selecting a particular coordinate system 
for as long as possible in the course of solving a problem. Tensor calculus provides 
algorithms for constructing analytical expressions that are valid in all coordinate 
systems simultaneously. You will soon learn that tensor expressions, such as 
translate to consistent and detailed recipes for calculating geometric quantities 
in particular coordinate systems. If an expression evaluates to the same value in 
all coordinate systems, then it must be endowed with geometric meaning. Tensor 
calculus has proven to be a remarkably effective framework in which Lagrange’s 
mutual forces can be lent between algebra and geometry. 


3.6 A Catalog of Coordinate Systems 

In this section, we describe the most commonly used coordinates systems in 
Euclidean spaces: Cartesian, affine, polar, cylindrical and spherical. A good way 
to illustrate a coordinate system graphically is by drawing the coordinate lines in 
two dimensions (e.g. Fig. 3.3), and coordinate surfaces in three dimensions (e.g. 
Fig. 3.4). A coordinate line or surface is a set of points for which the value of a 
particular coordinate is fixed. For example, in Cartesian coordinates x, y, z in three 



Z 1 =0 



A 

'B 


Z 2 = 5 Z 2 = 4 Z 2 = 3 Z 2 = 2 Z 2 = 1 


Fig. 3.3 Coordinate lines for a typical curvilinear coordinate system 
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Fig. 3.4 Spherical coordinates. The plot on the right shows an example of a coordinate surface for 
each variable 


dimensions, the coordinate surface corresponding to x = 1 is a plane parallel to 
the jz-plane. In spherical coordinates, the coordinate surface r = 2 is a sphere of 
radius 2 centered at the origin. 

The figures below show coordinate lines corresponding to integer values of 
the fixed coordinate. In other words, the display coordinate lines are spaced one 
coordinate unit apart. When a Euclidean space is referred to a coordinate system, 
the term unit becomes ambiguous: it may refer to a coordinate unit or a unit of 
Euclidean length. For example, in Fig. 3.3 depicting a generic coordinate system 
Z 1 , Z 2 , the points A and B have coordinates (—1,2) and (—2, 2) and are therefore 
one coordinate unit apart. However, the Euclidean distance between A and B may 
be quite different. 

In all likelihood, you are already familiar with the most common coordinate 
systems described below. Nevertheless, we invite you to read the following sections, 
since we describe each coordinates system in absolute terms, rather than (as done in 
most texts) by a reference to a background Cartesian grid. This absolute approach 
is more true to the spirit of tensor calculus. 


3.6.1 Cartesian Coordinates 

We start with the legendary Cartesian coordinates. Cartesian coordinates are the 
most used and the most overused coordinate system. It is a natural choice in many 
situations and, in an a number of ways, the easiest coordinate system to use. It 
is also an unnatural choice in many situations where it is used anyway, especially 
those situations where one need not choose a coordinate system at all. It is one of 
the goals of this book to rid the reader of the acquired dependency on Cartesian 
coordinates. 

Cartesian coordinates x, y refer the Euclidean plane to a regular square grid with 
coordinate lines one Euclidean unit apart, as illustrated in Fig. 3.5. A point with 
coordinates x = xo and y = Vo is denoted by (xq, yo). The point 0 with coordinates 
(0, 0) is called the origin. 
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Fig. 3.5 Two different Cartesian coordinate systems 


Cartesian coordinates (as well as affine coordinates described next) can be 
characterized by a natural coordinate basis i.j. The vector i connects the origin 
to the point with coordinates (1,0) while the vector j connects the origin to the 
point with coordinates (0, 1). A vector V connecting the origin to the point with 
coordinates (xo, .Vo) can be expressed by the linear combination 

V = x 0 i + Vo j- (3.13) 

Thus, a Cartesian coordinate yields an orthonormal basis i, j for vectors in the plane. 
Conversely, any orthonormal basis i.j, combined with the location of the origin, 
corresponds to a Cartesian coordinate system. 

The requirement that the coordinate unit equals the Euclidean unit of length is 
essential to the definition of a Cartesian coordinate system. A similarly constructed 
coordinate systems with integer coordinate lines two Euclidean units of length apart 
would no longer be considered Cartesian. It would instead be characterized as an 
orthogonal affine coordinate system. 

There are infinitely many Cartesian coordinate systems in the plane. A Cartesian 
coordinate system is characterized by three continuous degrees of freedom: the 
location of the origin and direction of the x-axis. Furthermore, the role of axes can 
be reversed. That is, i and j can be chosen such that the shortest path from i to j is 
either counterclockwise or clockwise. In the former case, the coordinates are called 
right-handed, in the latter — left-handed. 

Exercise 19. Generalize the contents of the section to three dimensions. 

Exercise 20. Describe a set of six degrees of freedom in choosing Cartesian 
coordinates in the three-dimensional space. 

The concept of orientation extends to arbitrary dimensions. In three dimensions, 
a Cartesian coordinate system is called right-handed if the coordinate triplet i.j, k 
satisfies the right-hand rule: when one curls the fingers of the right hand from the 
vector i to the vector j, the thumb points in the direction of vector k. If the thumb 
points in the direction opposite of the vector k, the coordinate system is called 
left-handed. 
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Fig. 3.6 An affine coordinate 
system 



3.6.2 Affine Coordinates 


Affine coordinates are characterized by a skewed but otherwise regular grid of 
coordinate lines, as in Fig. 3.6. The easiest way to introduce affine coordinates is 
by selecting an origin O and an arbitrary triplet of vectors i, j, k which may point in 
any direction and have any length. To determine the affine coordinates of a point A, 
consider a vector Y = OA and decompose V with respect to the basis i, j, k 

V = x 0 i + jo j + Zok. (3.14) 

Then the point A is assigned coordinates xo, yo, Zo- 

The triplet i, j, k is the coordinate basis. The vector i points from the origin to 
the point with coordinates (1,0,0), just as in the case of Cartesian coordinates. 
Cartesian coordinates are a special case of affine coordinates with an orthonormal 
coordinate basis i.j.k. Any two affine coordinate systems are related by an 
invertible linear transformations with a shift. In particular, any affine coordinate 
system is related to a Cartesian coordinate system by a linear transformation with a 
shift. The last statement may also be taken as the definition of Cartesian coordinates. 

Affine coordinates can also be referred to as rectilinear coordinates , that is, 
coordinate systems where the coordinate lines are straight. Coordinate systems 
where the coordinate lines are not straight are called curvilinear. We now turn to 
the description of the most fundamental curvilinear coordinates. 


3.6.3 Polar Coordinates 

The construction of a polar coordinate system in the plane is illustrated in Fig. 3.7. 
An arbitrary point O is designated as the pole or origin and a ray emanating from 
O is selected. This ray is known as the polar axis. Each point A in the plane is 
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Fig. 3.7 A polar coordinates 
system 



assigned two coordinates, r and 9, where r is the Euclidean distance from the 
pole and 9 is the signed angle that the segment OA forms with the polar axis. 
According to the typical sign convention, the angle 9 is positive if the angle is 
formed in the counterclockwise direction. Each point in the plane is characterized 
by a unique value of r, but infinitely many values of 9. That is, coordinates (r. 9) 
and (r, 9 + 2 Jtn) for any integer n refer to the same point in the plane. Finally, the 
angle 9 is not defined at the origin O. 

Notice that we described polar coordinates in absolute terms, rather than 
by referencing some previously established Cartesian coordinate system. This is 
important from two points of view. First, it is very much in the spirit of tensor 
calculus, which proclaims that all coordinate systems are created equal and it is only 
the specifics of the problem at hand that dictate the choice of a particular coordinate 
system. Second, it is good to see an explicit reference to Euclidean distance and 
angle and to realize that polar coordinates are intrinsically linked to the Euclidean 
nature of the plane. In particular, r is measured in Euclidean units. A rescaled polar 
coordinate system would no longer be called polar coordinates , but rather scaled 
polar coordinates. 


3.6.4 Cylindrical Coordinates 

Cylindrical coordinates extend polar coordinates to three dimensions. Cylindrical 
coordinates are constructed as follows. First a plane is selected and a polar 
coordinate system is constructed in that plane. This special plane is referred to 
as the coordinate plane. Then each point A in the Euclidean space is described 
by the triplet of numbers (r, 9,z), where r and 9 are the polar coordinates of the 
orthogonal projection of A onto the coordinate plane and z is the signed Euclidean 
distance between A and the coordinate axis. The sign convention usually follows 
the right-hand rule. The origin O of the polar coordinates in the coordinate plane is 
also called the origin with regard to the cylindrical coordinates. 

The straight line that passes through O and is orthogonal to the coordinate plane 
consists of points for which r = 0. It is known as the cylindrical or longitudinal 



32 


Coordinate Systems and the Role of Tensor Calculus 



Fig. 3.8 Cylindrical coordinates. The plot on the right gives an example of a coordinate surface 
for each variable 


axis. An alternative interpretation of r is the Euclidean distance from this axis. The 
term cylindrical comes from the fact that points described by constant r form a 
cylinder (Fig. 3.8). 


3.6.5 Spherical Coordinates 

Spherical coordinates r,9,<p are perfectly intuitive because the angles 9 and <p 
correspond, respectively, to longitude and latitude on the surface of the Earth, and 
r is the distance to the center of the Earth. A formal construction of spherical 
coordinates starts by selecting a coordinate plane with an origin 0 and a polar axis 
that establishes the azimuthal angle (p. The coordinate r is defined as the Euclidean 
distance to the origin O. The set of points corresponding to a given value of r is a 
sphere of that radius with the center at the origin. Such spheres are called coordinate 
spheres. Fixed r and 6 correspond to points that form a meridian on a coordinate 
sphere of radius r. The final coordinate 9, known as the longitudinal angle, varies 
from 0 to it and gives the angle between OA and the longitudinal axis, a straight 
line orthogonal to the coordinate plane that passes through the origin 0 . The plane 
of the equator corresponds to 9 = it/ 2. Neither angle is determined at the origin O. 
The azimuthal angle (p is undefined along the longitudinal axis. 


3.6.6 Relationships Among Common Coordinate Systems 

Exercise 21. Given a polar coordinate system, introduce Cartesian coordinates that 
originate at the same pole and for which the .r-axis coincides with the polar axis. 
Show that the Cartesian coordinates ( x , y) are related to the polar coordinates (r, 9) 
by the following identities: 


3.6 A Catalog of Coordinate Systems 33 

x (r, 0) = r cos 6 (3.15) 

y(r,8) = r sin 6. (3.16) 

Exercise 22. Show that the inverse relationship is 

r (x, y) = yjx 2 + y 2 (3.17) 

0(x,y) = arctan—. (3.18) 


Exercise 23. Align a Cartesian coordinate system x,y,z with a given cylindrical 
coordinate system in a natural way. Show that x,y,z and r, 9, z (we hope it is OK 
to reuse the letter z) are related as follows 


x (r, 9,z) = r cos 9 
y (r, 9, z) = r sin 9 
z ( r , 9, z) = z. 

Exercise 24. Show that the inverse relationships are 

r (x, y, z) = \/x 2 + y 2 
y 

8 (x, y, z) = arctan — 
x 

z(x,y,z) = z. 


(3.19) 

(3.20) 

(3.21) 

(3.22) 

(3.23) 

(3.24) 


Exercise 25. Align a Cartesian coordinate system with a spherical coordinate 
system in a natural way. Show that x, y, z and r, 9, (f> are related by 


x (r, 9,<f>) = r sin 9 cos (p 
y (r, 8, (j)) = r sin 9 sin <fi 
z(r, 9,(p) = r cos 8. 

Exercise 26. Show that the inverse relationships is 

r (x, y, z) = \/x 2 + y 2 + z 2 
z 

9 (x, y,z ) = arcsin 


(j) (x, y,z ) = arctan — . 


y/x 2 + y 2 + z 2 

y 


(3.25) 

(3.26) 

(3.27) 

(3.28) 

(3.29) 


(3.30) 
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3.7 Summary 

In this chapter we highlighted some of the most important advantages of coordinate 
systems. Of course, the coordinate approach hardly needs a champion. However, 
there are many perils of using coordinate systems that include loss of geometric 
insight and unmanageable complexity of calculations. We will find that tensor 
calculus goes a long way towards overcoming these challenges and thus helps make 
the use of coordinate systems even more powerful. 

There are a handful of coordinate systems that are used in solving problems 
for special geometries. Several of these — Cartesian, affine, polar, cylindrical, and 
spherical — were described in this chapter. 



Chapter 4 

Change of Coordinates 


4.1 Preview 

Tensor calculus achieves invariance by establishing rules for forming expressions 
that evaluate to the same value in all coordinate systems. In order to construct 
expressions that do not depend on the choice of coordinates, one must understand 
how individual elements transform under a change of coordinate. This chapter 
is therefore devoted to the study of coordinate changes and the corresponding 
changes in calculated quantities. Looking ahead, we discover that certain objects 
transform according to a special rule that makes it particularly easy to form invariant 
combinations. Such objects are called tensors. 

In this chapter, we also introduce the essential elements of the tensor notation. We 
begin to use indices in earnest and show their great utility in forming very compact 
and readable expressions. 


4.2 An Example of a Coordinate Change 

Consider the relationship between Cartesian coordinates x, y and polar coordinates 
r. 9 in the plane: 

r (x, y) = y / 'x 2 + y 1 (4.1) 

9 (x, y) = arctan — . (4.2) 

x 

The inverse relationship is 

x (r, 9) = r cos 9 (4.3) 

y (r, 9) = r sin$. (4.4) 


P. Grinfeld, Introduction to Tensor Analysis and the Calculus of Moving Surfaces, 
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4 Change of Coordinates 


We note that in these equations, some letters denote more than one object. In 
equations (4.1) and (4.2), the letters x and y denote independent variables while r 
and 9 denote functions of x and y. In equations (4.3) and (4.4), the letters x and y 
denote functions or r and 9, while r and 9 are independent variables. One may find 
it helpful to use new letters, / and g, to denote these functions: 


r = f (x, y) = \j x 1 + y 2 (4.5) 

9 = g (x, y) = arctan — (4.6) 

x 

and two more letters, h and /, for the inverse relationships: 

x = h (r, 9) = r cos 9 (4.7) 

V = i ( r , 9) = r sin 9 (4.8) 


We do not use extra letters in our discussion. Instead, we use another notational 
device for distinguishing between objects such as /'-the- independent- variable and 
r-the-function-cf-A-and-y: when we mean r-the-function, we include the functional 
arguments and write r (a, y). That makes it clear that we are talking about r-the- 
function rather than r-the-independent-variable. 


4.3 A Jacobian Example 


Let us construct the matrix J (x, y), often denoted in literature by 3 (r, 9) /3 (a, y), 
which consists of the partial derivatives of the functions r (x, y) and 9 (a, y ) with 
respect to each of the Cartesian variables: 


J (x, y ) 


3r(x,j) dr(x,y) 

3jc dy 

d 6(x,y) d 0(x,y) 

dx dy 


(4.9) 


This matrix is called the Jacobian of the coordinate transformation. Evaluating the 
partial derivatives, we find 


J (x, y) 


X 

-y 

x 2 +y 2 


y 

X 

x - 2 +y 2 


(4.10) 


Next, calculate the Jacobian J' for the reverse transformation of coordinates: 


3 x(r,6) dx(r,0 ) 

dr 3 6 

3 y(r,6) 3 y(r,6) 

dr 3 6 


J ' (r, 9 ) = 


(4.11) 
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Evaluating the partial derivatives, we find 


J' = 


cos 9 
sin 9 


—r sin 9 
r cos 9 


(4.12) 


Exercise 27. Calculate the determinant | / 1 . 

Exercise 28. Evaluate J at the point with Cartesian coordinates x = 1, y = 1? 

Exercise 29. Calculate the determinant | / ' | . How does it relate to | / 1 ? 

Exercise 30. Evaluate J' at the point with polar coordinates r = \/2, 9 = n/A 
(which is the same as the point with Cartesian coordinates x = 1, V = 1). 

Exercise 31. Evaluate the matrix product 

JJ' (4.13) 


at the point with Cartesian coordinates x = 1 , y = 1 or, equivalently, polar 
coordinates r = V 2, 9 = it I A. Did your calculation yield the identity matrix? 


4.4 The Inverse Relationship Between the Jacobians 


In this section, we find that the Jacobians J and J' evaluated at the same point in 
space are matrix inverses of each other: 


JJ' = 


1 0 
0 1 


(4.14) 


In order to carry out the multiplication, we must express both Jacobians in the 
same coordinates. We have treated J (x, y) as a function of x and y and J' (r, 9) as 
a function of r and 9. But you may also think of J and ./ ' as functions of points in 
the Euclidean space. As such, they may be expressed in either coordinates. Express 
the Jacobian J (x, y ) in polar coordinates: 



r *(r,0) 

y(r.9) 1 


cos 9 sin 9 

sin 9 cos 6 

J (r, 9) = 

*Jx(r,0?+y(r,O) 2 

-y(r,0) 

V A-(r,6>) 2 +vM) 2 

= 


x(r,0) 2 +y(r,9) 2 

x(r,6) 2 +y{r,9) 2 _ 


r r - 


and evaluate the matrix product JJ': 


cos 9 

sin 9 


cos 9 —r sin 9 


"l o" 

sin# 

r 

cos 0 
r 


sin 9 r cos 9 


_° l . 
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As predicted, the result is the identity matrix. The remainder of this chapter 
is devoted to demonstrating that this special relationship is holds for arbitrary 
coordinate changes. This task gives us an opportunity to introduce the key elements 
of the tensor notation. 

Exercise 32. Show that J and J' are the inverses of each other by transforming 
J' (r, 6) to Cartesian coordinates. 

Exercise 33. In three dimensions, analyze the transformation from Cartesian to 
spherical coordinates and confirm that the associated Jacobians are matrix inverses 
of each other. 


4.5 The Chain Rule in Tensor Notation 


Suppose that F is a function of three variables a, b, and c : 

F = F(a,b,c). (4.17) 

Suppose further, that we have three functions of two variables /x and v: 

A(/x,v) (4.18) 

B(n,v) (4.19) 

C (/x, v) . (4.20) 

Form a new function / (/x, v) by composition 

/ 0c v) = F (A (/x, v) , B (/x, v) , C (/x, v)) . (4.21) 


Use the chain rule to evaluate the derivative of / (/x, v) with respect to each of the 
arguments: 


9 / (/ 4 . v) _ 

3/x 

9 / (B'V) = 
dv 

Exercise 34. Evaluate 

3/x 2 


3 F 9 A 3 F 3 B 3 F 3 C 

3 a 3/x db 3/x 3c 3/x 

3 F 3 A 3 F 3 B 3 F 9 C 
3a 3u db dv 3c 3v 

9 2 /(/x,v) 9/ 2 (/x, v) 

3/x3v ’ “ 3v 2 ' 


(4.22) 

(4.23) 


(4.24) 


Produce expression similar to equations (4.22) and (4.23). 
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Exercise 35. Derive the third-order derivative <f\f ( p , v) / d 2 pdv. 

We now take several decisive steps towards converting equations (4.22) 
and (4.23) to tensor form. The first step is to rename the variables a, b, and c into 
a 1 , a 2 , and a 3 . Collectively, we refer to these variables as a 1 . Note that we use an 
upper index, or superscript, and that it denotes enumeration and not exponentiation. 

The fact that F is a function of a 1 , a 2 , and a 3 can be expressed in one of three 
ways: 


F = F (a 1 , a 1 , a 3 ) 

(4.25) 

F = F ( a ') 

(4.26) 

F = F (a) . 

(4.27) 


We prefer equation (4.27). That is, we represent the entire collection of arguments 
by a single letter and suppress the superscript i. It is important not to think of 
the symbol a as a vector with components a 1 , a 2 , and a 3 . (A vector is a directed 
segment, after all, not a triplet of numbers). Rather, F (a) is a shorthand notation 
for F (a 1 ,a 2 , a 3 ). 

The next step in converting equations (4.22) and (4.23) to tensor form is to use 
the symbols A 1 , A 2 , and A 3 for the three functions A, B, and C : 


A 1 (p, v) 

(4.28) 

A 2 (p,v) 

(4.29) 

A 3 (p, v) . 

(4.30) 


Collectively, these functions can be denoted by A' (p, v). Once again, we use a 
superscript for enumeration. 

The function / (p, v ), constructed by composing F (a) — that is F (a 1 , a 2 , a 3 ), 
with A' (p, v) — that is A 1 (p, v). A 2 (p, v) and A 3 (p, v ), can now be expressed by 
the following compact equation: 

/ (p, v) = F (A (p, v)) . (4.31) 

When expanded, this expression reads 

/ (p, v) = F ( A 1 (p, v ) , A 2 (p, v) , A 3 (p, v)) . (4.32) 

With the use of superscripts, the equations (4.22) and (4.23) representing the 
chain rule, can be rewritten more compactly way with the help of the summation 
symbol 
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Fig. 4.1 Albert Einstein 
(1879-1955) was a celebrated 
twentieth-century scientist 
and one of the earliest 
proponents of tensor calculus. 
Einstein suggested to 
suppress the summation sign 
in tensor expressions 



3/ (M* v ) 

dp 

3/ 0^ v) 

3v 


i = l 


dF dA‘ 
da' dp 

(4.33) 

dF 3 A‘ 

da 1 dv 

(4.34) 


We are well on our way towards tensor notation. In the expression dF /da' , the 
index i is a lower index , or subscript, since it is a superscript in the denominator. If 
the expression dF/da' were denoted by a special symbol, such as V,- F or F\, , the 
index would necessarily appear as a subscript. 

The next important element is the elimination of the summation sign. According 
to the Einstein notation (or the Einstein summation convention), summation is 
implied when an index appears twice, once as a subscript and once as a superscript. 
Albert Einstein whose portrait appears in Fig. 4.1 suggested the convention in 1916 
[10] and later joked in a letter to a friend [25]: I have made a great discovery in 
mathematics; I have suppressed the summation sign each time that the summation 
must be made over an index that occurs twice. . . 

With the Einstein convention, equations (4.33) and (4.34) become 


3/ (p.v) _ 

dF 3 A' 

(4.35) 

dp 

da' dp 

3/ (M. v) _ 

dF dA' 

(4.36) 

dv 

da' 3u 


The repeated index i, which implies summation, is called a dummy index, or a 
repeated index, or a contracted index. Summation over a dummy index is called 
a contraction. 
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Exercise 36. Derive the expressions for 

d 2 / (/4.v) 3 2 / (/L v) df (H'V) 
3/z 2 ’ 3/x3v ’ dv 2 


(4.37) 


similar to equations (4.35) and (4.36). 

The final step in converting equations (4.22) and (4.23) to tensor form is to 
combine them into a single expression. Denote the independent variables /x and 
v by /x 1 and /z 2 , or collectively /i u . We utilize a letter from the Greek alphabet 
because this index assumes values 1, 2, compared to the values 1,2,3 assumed by 
Latin indices. 

The fact that / is a function of /z 1 and /x 2 can also be expressed in one of three 
ways. 



(4.38) 

/ GO 

(4.39) 

/ (m) > 

(4.40) 


and we once again choose the third way, where the index of the functional argument 
is dropped. Similarly, the fact that each A' is a function of /z 1 and /z 2 is denoted by 
A'' (/i). 

The composition by which / (/z) is formed can now be expressed quite concisely 

/ (/x) = F (A (,x)) . (4.41) 

Note that we suppressed both indices of functional arguments. The compact 
equation (4.41) in actuality represents 

/(/z'./z 2 ) = F (A 1 A 2 (ix 1 ,^ 2 ). A 3 (^,fJi 2 )). (4.42) 

We are now able to capture the two partial derivatives of / by a single elegant 
expression: 


3/ 3 F 3/1 ' 

— — = (4 43) 

3/z“ da’ 3/z" 

Let us review the main features of equation (4.43). First, this single expression 
represents two relationships, one for a = 1 and one for a = 2. This makes a a 
live index. Second, on the right-hand side, there is an implied summation over the 
dummy index i because it is repeated as a subscript and a superscript. 

Exercise 37. Show that the tensor expression for the collection of second-order 
partial derivatives is 
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d 2 f d 2 F 3 A‘ 3 AJ 3 F d 2 A' 

; = 1 . (4.44) 

3/i“ 3 /iP a A 1 dA-i dii a d/i 3 A’ d/i a diJ.P 

How many identities does the resulting tensor expression represent? 

Exercise 38. Derive the tensor expression for the collection of third-order partial 
derivatives 


3 (jl 01 3 3 ’ 

In this section, we introduced the essential features of the tensor notation. Over 
the next few chapters we will discover just how robust the tensor notation is. It is 
capable of expressing a broad range of operations in mathematics, including the 
matrix product and the operation of the transpose. 


4.6 Inverse Functions 


The inverse matrix relationship between the Jacobians J and J' is a consequence 
of the fact that the transformations from Cartesian to polar coordinates and back 
are the functional inverses of each other. Let us first explore the inverse relationship 
between functions of one variable. 

What does it mean for two functions / (x) and g (x) to be the inverses of each 
other? It means that applying f to g (x) yields x: 


g if (*)) = *• 


(4.46) 


From this identity, we can obtain a relationship between the derivatives f and g . 
Differentiating with respect to x yields 

g'(f (*))/'(*) = 1. (4.47) 


or 


/'(*) 


1 

g' if (*)) ’ 


(4.48) 


Thus, with some care, we can state that g' and /' are algebraic inverses of each 
other. We must remember that, in order for this statement to be true, g' must be 
evaluated at / (x) rather than x. For example, the derivative of / (x) = e 2x at 
x = 3 is 2e 6 . Meanwhile, the derivative of g (x) \ = lnx at / (3) = e 6 is 
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Exercise 39. Derive that 


d arccos x 
dx 


= ± 


1 

Vl — x 2 


(4.49) 


by differentiating the identity cos arccos x = x. 

Exercise 40. Suppose that / (x) is the inverse of g (x). Show that the second 
derivatives f" and g" satisfy the identity 

g" (/ W) f (v) 2 + g' (/ (X)) f" (x) = 0. (4.50) 


Exercise 41. Verify equation (4.50) for / (x) = e x and g (x) = lnx. 
Exercise 42. Verify equation (4.50) for / (x) = arccos x and g (x) = cosx. 


Exercise 43. Differentiate the identity (4.50) one more time with respect to x to 
obtain a new expressions relating higher-order derivatives of / (x) and g (x). 


4.7 Inverse Functions of Several Variables 

Equation (4.48) is the prototype of the matrix relationship between the Jacobians 
J and J' . Let us next generalize equation (4.48) for two sets of functions of two 
variables. Following that, we use the tensor notation to generalize to arbitrary 
dimension. 

How does the concept of inverse functions generalize to two dimensions? For 
example, consider the transformations between Cartesian and polar coordinates. 
The two transformations (4.1), (4.2) and (4.3), (4.4) are the inverses of each other. 
That means that substituting x (r, 9) and y (r, 9) into r (x , _y ) will recover r. (As 
an exercise, confirm that this is so.). Similarly, substituting x (r, 9) and y ( r , 9) into 
r (x, y ) will recover 9. Express these relationships analytically: 

r (x (r, 9) , y (r, 9)) = r (4.51) 

9 (x (r, 9) , y (r, 9)) = 9, (4.52) 

More generally, suppose that the sets of functions f(x,y), g(x,y) and 
F (X,Y), G(X,Y) are the inverses of each other. By analogy with (4.51) 
and (4.52), the inverse relationship can be captured by the identities 

F (f (x,y) ,g{x,y)) = x (4.53) 

G{f {x,y),g(x,y)) = y. (4.54) 

From these two identities we can obtain four relationships for the partial derivatives 
of F, G, f and g. These relationships are by differentiating each of the identities 
with respect to each independent variables. Differentiating the first identity (4.53) 
with respect to each variable yields 
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3 F d f 3 F 3 g 
dXdx + dYdx 

3 F 3/ 3F 3 g 
did y + dYdy 


(4.55) 

(4.56) 


Differentiating the second identity (4.54) with respect to each variable yields 


3 G df 3 G 3 g 

dX dx 3 Y dx 

3 G df 3 G dg 

dX 3 y 3F dy 


(4.57) 

(4.58) 


Combining the four identities into a single matrix relationship yields 


"3 F 3 F ~ 


r 3/ V-| 



3X 3 Y 


dx dy 


1 0 

3G 3 G 


3? ag 


0 1 

.ax a y . 


_ dx dy _ 




(4.59) 


which is precisely the relationship we were hoping to discover. It shows that the 
Jacobians J and J' for general coordinate changes are the matrix inverses of each 
another, provided they are evaluated at the same physical point. The one exciting 
task that remains is to generalize this relationship to an arbitrary dimension. We 
will do so in the next section with the help of the tensor notation. You will find 
that the tensor notation actually makes the N -dimensional case simpler than the 
two-dimensional case! 

Exercise 44. What is the equivalent of equations (4.51) where the roles of Cartesian 
and polar coordinates are reversed? 


x (...)= x (4.60) 

y (•••) = y? (4.61) 

Exercise 45. Show that the second-order partial derivatives of F and G, and / and 
g are related by the equation 


3 2 F (df\ 2 d 2 F dfdg 3 2 F /3g\ 2 dF d 2 f dF 3 2 g 
dX 2 { dx J + "3X3 Y dx dx + 3 Y 2 \ dx J + dX dx 2 + d Y dx 2 


Derive the remaining three second-order relationships. 


4.8 The Jacobian Property in Tensor Notation 
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4.8 The Jacobian Property in Tensor Notation 

In this section we generalize the inverse matrix relationship between Jacobians to 
arbitrary dimension. Along the way, we introduce an important new element of the 
tensor notation: the Kronecker delta symbol 8‘- . 

Consider two alternative coordinate systems Z' and Z 1 ' in an /V -dimensional 
space. Notice that we placed the prime next to the index rather than the letter Z. 
Let us call the coordinates Z' unprimed and the coordinates Z' primed. We also 
use the symbols Z' and Z' to denote the functions that express the relationships 
between the coordinates: 


Z i ' = Z i '{Z) (4.63) 

Z' = Z' ( Z ') . (4.64) 

The first expression represents N relationships analogous to equations (4.1) 
and (4.2) and the second represents N relationships analogous to equations (4.3) 
and (4.4). 

As sets of functions, Z' and Z' are the inverses of each other. This fact can be 
expressed either by the identity 


Z‘ (Z' (Z)) = Z' (4.65) 

or by the identity 

Z'' (Z (Z')) = Z 1 ' . (4.66) 

Note all the tensor conventions in play here. The indices of function arguments 
are suppressed. Several relationships are captured by a single expression with a 
live index. Note also, that in the first identity, the symbols Z' (on the left-hand 
side) and Z' (which is Z ! with the index suppressed) represent functions, while the 
symbols Z (which is Z' with the index suppressed) and Z' (on the right-hand side) 
represent the independent variables. Equation (4.65) is analogous to equations (4.53) 
and (4.54) while equation (4.66) is analogous to equations 

/ (F (X, Y) , G (X, Y)) = X (4.67) 

g(F(X,Y),G(X,Y))=Y. (4.68) 

Let us analyze the first identity Z' (Z' (Z)) = Z'. It represents N relationships 
and each of the N relationships can be differentiated with respect to each of the 
N independent variables. This will yield N 2 relationships for the first-order partial 
derivatives of the functions Z' and Z' . 

With the help of the tensor notation, all of the operations can be carried out in a 
single step. We differentiate the identity Z' ( Z' (Z)) = Z' with respect to Z J . It is 
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essential that the differentiation is to take place with respect to Z 7 rather than Z' , 
because our intention is to differentiate each of the identities in Z' (ZJ (Z)) = Z' 
with respect to each of the variables. The resulting expression will have two live 
indices i and j . 

The result of the differentiation reads 

3Z' dZ'' 3Z' 

= (4 69) 

3Z'' dZJ dZJ ' 

This single expression with two live indices i and j is analogous to equa- 
tions (4.55)— (4.58). The objects 3Z' / dZ 1 ' and 9Z' , /3Z 7 are precisely the Jacobians 
of the coordinate transformations. Let us now denote these Jacobians by the indicial 
symbols J 1 , and J’ : 

(4.70) 

(4.71) 

Incorporating these symbols into equation (4.69), it reads 


„ _ 3 Z< (Z 0 
3Z'' 

_ dZ'Uz) 
‘ 3Z' 




3Z' 

dZJ 


(4.72) 


and we must keep in mind that Jl, and J- must be evaluated at the same physical 
point. 

Now, what of the partial derivative 3Z'/3Z 7 ? The symbol Z' on the right-hand 
side of equation (4.65) represents the independent variable or the collection of the 
independent variables. The symbol Z J also represents the independent variable. 
Therefore, 3Z' /3Z 7 is 1 if Z' and Z 7 are the same variable, and 0 otherwise. 
There is a special symbol that denotes such a quantity: the Kronecker delta 81 also 
known as the Kronecker symbol. It is defined as follows 


( 1, if i = j 

8 ) = 

( 0, if i ± j. 


(4.73) 


From the linear algebra point of view, the Kronecker delta can be thought of as the 
identity matrix. It is an object of utmost importance and will play a crucial role in 
the remainder of the textbook. 

With the help of the Kronecker delta, we can rewrite equation (4.72) in its final 
form 


j}'Jj = 8'j. (4.74) 

This equation expresses the inverse relationships between the Jacobians that we have 
set out to demonstrate. 


4.8 The Jacobian Property in Tensor Notation 
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Exercise 46. By differentiating 

Z' v (Z (Z')) = Z'' (4.75) 


show that 


Jl J)> = 8), 


(4.76) 


Exercise 47. Derive equation (4.76) from (4.74) by multiplying both sides by J 1 .,. 

Exercise 48. Introduce the symbols J},., and J'/j for the second-order partial 
derivatives 


• _ 3 2 Z' (Z ') 

l 'j' 3Z''3ZZ 
= 3 2 Z'' (Z) 
ij dZ'dZi ' 


(4.77) 

(4.78) 


Show that 


t > /' [J + ji v 

j J k ^ J i' J j 


1 T j k = o. 


(4.79) 


How many identities does this tensor relationship represent? 

Exercise 49. What is J \, for a transformation from one affine coordinate system 
to another? 

Exercise 50. Derive the identity 


Ji'j'J) + Jl'JjkJy = 0 


(4.80) 


by contracting both sides of equation (4.79) with /*, and subsequently renaming 
j ' -s- k' . Verify this relationship for the transformation from Cartesian to polar 
coordinates. 

Exercise 51. Show that 


j;,j, + = o. (4.8i) 

Equations (4.79), (4.80), (4.81) will prove to be of critical importance in Chap. 8. 

Exercise 52. Derive third-order relationships of the same kind. 

Exercise 53. Consider three changes of variables: from Z' to Z' to Z l " and back 
to Z' . Show that 





(4.82) 
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4.9 Several Notes on the Tensor Notation 

We have already encountered many elements of the tensor notation — and seen many 
of its benefits. In this section, we point out a few properties of the notation that have 
already been used in this chapter, but could be clarified further. 


4.9.1 The Naming of Indices 

Indices are frequently needed to be renamed depending on the context in which an 
object is being used. Sometimes the name of the index does not matter at all. For 
example, the collection of the partial derivatives of a scalar field F can be referred 
to as dF/dZ' or 3 F / dZ J — any letter (preferably a lowercase letter from the English 
alphabet) can be used to denote the index. Similarly, the Jacobian J‘, can be called 
Jj, or J‘, or J., — the names of indices do not matter until the object is used in 
an expression. When referring to an object T ‘ , we may call it T ! or T k or T" 
depending on what fits the current context. 

A repeated index always denotes contraction. In particular, a repeated index must 
appear exactly twice, once as upper and once as lower. For example, the collection 
of second-order partial derivatives 


3 2 F 

dZ‘ dZ i 


cannot be denoted by 


3 2 F 

3Z' 3Z' ' 


(4.83) 


(4.84) 


This expression could be interpreted as a second-order derivative with respect to the 
same variable, for instance, d 2 F/dr 2 and d 2 F/d6 2 . However, tensor calculus does 
not interpret it this way. By convention, objects with repeated indices are disallowed, 
except in a contraction. 

When objects are combined to form longer expressions, renaming indices 
becomes more restricted. Given a contraction, such as 


3 F 3Z' 

3 ^ 3 ^’ 

the index i, if renamed, should be renamed in both instances: 


(4.85) 


3F 3Z' 3 F dZ-i 


3Z' 3 dZJ 3/i“ ' 


(4.86) 
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The repeated index indicating contraction can be renamed arbitrarily, as long as the 
new name does not appear in another part of the term, causing a clash. For example, 
in the expression 


dF dZ‘ dF dZJ 


(4.87) 


the index i can be changed to any letter, except j ! 

Finally, in tensor identities, live indices must be renamed consistently. For 
example, suppose that 


3 F dZ i 

dzJJj?' 


(4.88) 


Then a can be changed to f J > on both sides of the identity: 


a ^ 8zj 

dzJJ^p' 


(4.89) 


Notice that, for good measure, we also renamed the dummy index from i to j . 


4.9.2 Commutativity of Contractions 

We have already encountered several expressions that include two or more contrac- 
tions. One such expression was Jl,J l j k Jy. Should we have been concerned with the 
order of contractions? The answer is no because contractions commute, that is, they 
can be carried out in any order. 

For an illustration, consider an object T ’! J , with four indices and the double 
contraction T‘j . The expression T‘j can be interpreted in three ways. First, we can 
imagine that contraction on i takes place first, yielding the object Uj = Tjj . Fully 
expanding, Uj is the sum 


uj = Tf + T* j + T* J . (4.90) 

Subsequently, contraction on the remaining indices yields Uj = T‘j : 

Tjj = Tjl + Til + Til + Til + Til + Til + + 7#. (4.91) 

Alternatively, we can contract on j first (yielding a temporary object Vj = Tj 1 . ) 
and subsequently on i . You can confirm that this sequence will result in the same 
sum. Finally, the contractions can be viewed as simultaneous. That is, the terms 
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in the sum are formed by letting the indices i and j run through the N 2 possible 
combinations. It is clear that in this case the end result is also the same. 

4.9.3 More on the Kronecker Symbol 

The Kronecker delta Sj will prove to be an extremely important object. Its 
importance goes beyond its linear algebra interpretation as the identity matrix. For 
now, we would like to say a few words about its index-renaming effect in indicial 
expressions. 

Consider the expression T/8j . What is the value of this object for j = 1? It is 

Tj8[ = 7j<$} + T 2 S] + T 3 8\. (4.92) 

The only surviving term is T\8\ which equals 7j. Similarly for j = 2, TjSj is T 2 . 
And so forth. Summarizing, 

TjS'j = Tj. (4.93) 

Thus, the effect of contracting 7} with S'- is to rename the index i to j . Similarly, 

V = T j . (4.94) 

The last two equations show why the Kronecker delta can also be viewed as the 

identity matrix: when contracted with an object, the result is the original object. 

Exercise 54. Simplify the expression A S‘ k . 

Exercise 55. Simplify the expression S'j S J k . 

Exercise 56. Evaluate the expression S'j Sj . 

Exercise 57. Evaluate the expression 8) Sj . 

Exercise 58. Simplify the expression Sj S, S J . 

Exercise 59. Show that the object SjSj —SjSj is skew-symmetric in i and j as well 
as in r and s. 

4.10 Orientation-Preserving Coordinate Changes 

Let J be the determinant of the Jacobian We express this relationship in the 
following way 

/ = \ Jj 


(4.95) 



4. 1 1 Summary 
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The coordinate change is called orientation-presen’ing if J >0. The coordinate 
change is called orientation-reversing if J <0. These definitions are local: the 
orientation-preserving property may vary from one point to another. Since the 
Jacobians and J- are matrix inverses of each other, the determinant of J‘ is 
J~'. Therefore, if a change of coordinates is orientation-preserving, then so is its 
inverse. The orientation-preserving property of coordinate changes is particularly 
important for the discussion of the volume element and the Levi-Civita symbols in 
Chap. 9. 

If two coordinate systems are related by an orientation-preserving change of 
coordinates, then one of those coordinate systems is said to be positively oriented 
with respect to the other. In the three-dimensional space, the right-hand rule gives us 
an opportunity to characterize the orientation of a coordinate system in an absolute 
sense. A Cartesian coordinate system is positive if its coordinate basis vectors i, 
j. k are ordered according to the right-hand rule. Any other coordinate system is 
called positive in the absolute sense if it is positively oriented with respect a positive 
Cartesian coordinate system. 


4.11 Summary 

In this chapter, we discussed coordinate changes. Tensor calculus achieves invari- 
ance by constructing expressions where the eventual values remain unchanged 
under a change of coordinates. This is why the discussion of coordinate changes 
is critical. In the next chapter, we begin to introduce some of the most important 
objects in Euclidean spaces. In Chap. 6, we study how these objects transform 
under a change of coordinates. We find that some objects, characterized as tensors, 
transform according to a very special rule. Other objects (e.g., partial derivatives of 
tensors) do not transform according to the same rule. This lack of the tensor property 
poses a problem in constructing invariant expressions. This problem is addressed 
in Chap. 8 which introduces a new kind of tensor-property-preserving differential 
operator: the covariant derivative. 

In this chapter we also presented all the essential elements of the tensor notation. 
We saw how the indicial notation leads to very compact expressions that are easily 
interpreted. We also observed that indices naturally arise as lower indices and upper 
indices, and that the two types of indices seem to be in perfect balance. This is not 
a coincidence: the placement of the index helps distinguish between covariant and 
contravariant tensors. 


Chapter 5 

The Tensor Description of Euclidean Spaces 


5.1 Preview 

Armed with the tensor notation introduced in Chap. 4, we present a differential 
description of Euclidean spaces in tensor terms. The objects presented here are of 
great importance in the coordinate-based description of space and play a central role 
in the remainder of this book. 

All objects, except the position vector R. introduced in this chapter depend on 
the choice of coordinates. That is, if the same definition is applied in different 
coordinates, the resulting values are different. Such objects are called variants. 
Objects, such as the position vector R, the values of which are the same in 
all coordinates, are called invariants. For some variants, their values in different 
coordinate systems are related in a particular way. Such objects are called tensors. 
The discussion of the tensor property takes place in Chap. 6. 


5.2 The Position Vector R 

The position vector (or the radius vector ) is a vector R that represents the position of 
points in the Euclidean space with respect to an arbitrarily selected point O, known 
as the origin. The concept of the position vector is possible only in Euclidean spaces. 
After all, the position vector is straight, and straightness is a characteristic of the 
Euclidean space. If we were studying the surface of a sphere and were allowed to 
consider only the points on the surface and not in the surrounding space, we would 
not be able to introduce the position vector. 

The position vector R is introduced without the help of coordinates. Objects that 
can be constructed without a reference to a coordinate system are called geometric. 
The vector gradient of a scalar field is another example of a geometric object. 
Some of the objects we construct with the help of coordinate systems turn out 
to be independent of coordinates. Such objects are called invariant. Experience 
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shows that all invariant objects are geometric. As a result, the terms geometric and 
invariant are often used interchangeably. That is not to say that finding the geometric 
interpretation for an invariant object is always straightforward. 


5.3 The Position Vector as a Function of Coordinates 

Refer the Euclidean space to a coordinate system Z' . In the spirit of tensor calculus, 
the coordinate system Z' is arbitrary. In other words, we do not assume that 
it possesses any special characteristics. In particular, it is certainly not assumed 
Cartesian or affine. In the three-dimensional Euclidean space, the index i has values 
1, 2, 3 and Z' stands for Z 1 , Z 2 , Z 3 . When reading this chapter, it is probably best 
to keep the three-dimensional case in mind. Of course, all of the concepts are valid 
in any number of dimensions. 

In a Euclidean space referred to a coordinate system, the position vector R is a 
function of the coordinates Z'. That is, to each valid combination of Z 1 , Z 2 , and 
Z 3 , there corresponds a specific value of the position vector R. The coordinates 
uniquely determine the point in the Euclidean space which, in turn, corresponds to 
a particular value of the position vector R. Denote this function by R (Z): 

R = R(Z). (5.1) 

This notation is consistent with the convention described in Sect. 4.5 according to 
which we suppress the indices of the function arguments. 

The function R(Z) is the starting point in an important sequence of identities. 
It is therefore important to have a clear understanding of each of the elements of 
equation (5.1). This equation features the symbol R in two different roles. On the 
left-hand side, R represents the geometric position vector introduced in Sect. 5.2. On 
the right-hand side, R stands for the vector-valued function that yields the position 
vector for every valid combination of coordinates. As it is often the case, the same 
letter is used to represent two different objects. We distinguish among the objects 
denoted by the same letter by the full signature of the expression. In the case of 
equation (5.1), the signature includes the collection of arguments. Thus, R is the 
invariant position vector while R (Z) is a vector- valued function of three variables. 

The function R (Z) can be differentiated with respect to each of its arguments in 
the sense of Sect. 2.9. We reiterate that it is essential to treat R as a primary object 
that is subject to its own set of rules and operations. It is counterproductive to think 
of R as a triplet of components with respect to a some Cartesian grid. We have not 
and will not (unless it is convenient for a particular problem) introduce a background 
Cartesian coordinate system. Thus, there is no basis with respect to which R could 
be decomposed. Fortunately, we can perform quite a number of operations on vector 
quantities such as R, including differentiation with respect to a parameter. 


5.4 The Covariant Basis Z, 
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Exercise 60. Consider a curve parametrized by its arc length s, and consider the 
function R (s). Explain, on the basis of your geometric intuition, why T = R' (,v) 
is the unit tangent vector. Note that the condition |R' (s)\ = 1 may be taken as the 
definition of arc length s . 


5.4 The Covariant Basis Z, 


The covariant basis is obtained from the position vector R (Z) by differentiation 
with respect to each of the coordinates: 


_ 9R(Z) 
~ dZ< 


(5.2) 


The term covariant refers to the special way in which the vectors Z, transform under 
a change of coordinates. This is studied in Chap. 6. 

The covariant basis is a generalization of the affine coordinate basis to curvilinear 
coordinate systems. It is called the local coordinate basis since it varies from 
one point to another. It is sometimes described as the local basis imposed by the 
coordinate system. The letter Z is now used to denote an additional object. However, 
the symbols Z, and Z' cannot be mistaken for each other since the bold Z in Z, is a 
vector while the plain Z in Z' is a scalar. 

The tensor notation may still be quite new to you. Therefore, let us expand the 
relationships captured by equation (5.2). Since the index i is live, equation (5.2) 
defines three vectors Zi, Z 2 , and Z 3 : 


Z, = 


9R (Z 1 , Z 2 , Z 3 ) 
9Z 1 ’ 


Z, = 


9R (Z 1 , Z 2 , Z 3 ) 
9Z 2 ’ 


Z 3 = 


9R (Z 1 , Z 2 , Z 3 ) 


9Z 3 


(5.3) 


The argument-collapsing convention of Sect. 4.5 shortens these equation to 


= 9R(Z) . = 9R(Z) . 

9Z 1 ’ 2 9Z 2 


Z 3 


9R(Z) 

9Z 3 


(5.4) 


Finally, encoding this set of identities with the index i transforms them to the 
ultimate tensor form (5.2). 

Figure 5.1 shows the covariant basis Z, in the plane referred to Cartesian, affine, 
and polar coordinate systems. In affine coordinates, the covariant basis Z, coincides 
with the coordinate basis i, j. In curvilinear coordinates, the covariant basis varies 
from point to point. 

The covariant basis Z, is a primary object. Once again, you should not imagine a 
background Cartesian coordinate system and decompose the vectors Z, with respect 
to that system’s coordinate basis. It is counterproductive to think of Z, as a set of 
N 2 coordinates, where N is the dimension of the space. The vectors Z, are just that: 
vectors. 
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Fig. 5.1 The covariant basis Z, in Cartesian, affine, and polar coordinates in two dimensions 


At all points in the Euclidean space, the covariant basis Z, provides a convenient 
basis for decomposing other vectors. The components V of a vector V are the scalar 
values that produce Y when used in a linear combination with the vectors Z, : 

V = V% + V 2 Z 2 + V 3 Z 3 . (5.5) 


In the tensor notation 


V = V'Zj. (5.6) 

The values V are called the contravariant components of the vector V. The term 
contravariant refers to the way in which the object V transforms under a change of 
coordinates. This is discussed in Chap. 6. Since Z, varies from one point to another, 
two identical vectors U and V decomposed at different points of the Euclidean space 
may have different contravariant components. 

Exercise 61. What are the components of the vectors Z, with respect to the 
covariant basis Zj ? The answer can be captured with a single symbol introduced 
in Chap. 4. 


5.5 The Covariant Metric Tensor Z/y 

By definition, the covariant metric tensor Zjj consists of the pairwise dot products 
of the covariant basis vectors: 


Z ij =Z i -Z j . (5.7) 

The metric tensor is one of the central objects in tensor calculus. It is often referred 
to as the fundamental tensor. In linear algebra, the covariant metric tensor is 
known as the inner product matrix or the Gram matrix. Because the dot product 
is commutative (Z ,■ • Z ; = Z j • Z,- ), the metric tensor is symmetric: 


7 — 7 
^ ij ~ ^ ji • 


(5.8) 


5.6 The Contravariant Metric Tensor Z' 1 
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It carries complete information about the dot product and is therefore the main 
tool in measuring lengths, areas, and volumes. Suppose that two vectors U and V 
are located at the same point and that their components are U' and V . Then the dot 
product U • Y is given by 

U • V = Zy U' V j . (5.9) 

This can be demonstrated as follows: 

U • W = t/'Z, • V j Zj = (Z , • Z j) u‘ V j = Zjj u l v j . 

Exercise 62. Explain why the covariant basis, interpreted as a matrix, is positive 
definite. 

Exercise 63. Show that the length | V| of a vector Y is given by 

|v| = yj Zjj V’ vj . (5.io) 


5.6 The Contravariant Metric Tensor Z lJ 

The contravariant metric tensor Z' y is the matrix inverse of Zjj : 


Z i >Z jk =V k . (5.11) 

From linear algebra, we know that AB = I implies BA = I (the left inverse is the 
right inverse). Therefore, contracting on the other index also yields in the Kronecker 
symbol: 


Z i >Z kj =8i. (5.12) 

This relationship also follows from the symmetry of Zy , but the inverse argument 
is more general. 

Since the inverse of a symmetric matrix is symmetric (prove this), the contravari- 
ant metric tensor is symmetric 


Z ij = Z ji . 


(5.13) 


Exercise 64. Prove that Z' J is positive definite. 
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5.7 The Contravariant Basis Z' 


The contravariant basis Z' is defined as 


Z' = Z lJ Z r (5.14) 

Since Z ,J is symmetric, it makes no difference which index is contracted in 
equation (5.14). Thus we may also write 

Z' = Z Jl Zj. (5.15) 

The bases Z, and Z' are mutually orthonormal: 

Z'-Z j=8). (5.16) 

That is, each vector Z' is orthogonal for each Z Jt for which i^j. Furthermore, 
the dot product of Z' with Z, is 1. The latter relationship cannot be written as Z' • 
Z, = 1 since the repeated index would invoke the summation convention. Sometimes, 
uppercase indices are used to indicate a relationship for a single value of the repeated 
index: Z 1 -Z / = 1. 

Exercise 65. Demonstrate equation (5.16). Hint: Z' • Zj = Z' /f Z/ ; • Z j. 

Exercise 66. Show that the angle between Z 1 and Z\ is less than tt/2. 

Exercise 67. Show that the product of the lengths | Z 1 1 \L\ | is at least 1. 

Exercise 68. Show that the covariant basis Z, can be obtained from the contravari- 
ant basis ZJ by 


z, = Z, 7 Z'. (5.17) 

Hint: Multiply both sides of equation (5.14) by Z,-fc. 

Exercise 69. Show that the pairwise dot products of the contravariant basis ele- 
ments yields the contravariant metric tensor: 

Z'-Z j = Z ij . (5.18) 

Exercise 70. Argue from the linear algebra point of view that equation (5.16) 
uniquely determines the vectors Z' . 

Exercise 71. Therefore, equation (5.16) can be taken as the definition of the 
contravariant basis Z' and equation (5.18) can be taken as the definition of the 
contravariant basis Z lJ . Show that the property (5.11) follows these definitions. 


5.8 The Metric Tensor and Measuring Lengths 
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5.8 The Metric Tensor and Measuring Lengths 

The metric tensor plays a fundamental role in calculations. It is responsible for 
calculating lengths of vectors since |V| = -JZjj V' Vi . It is also the key to 
calculating lengths of curves. You are familiar with the following formula for the 
length L of a curve given in Cartesian coordinates by (x (t) ,y(t),z (t )): 



Note that this formula may also be taken as the definition of the length of a curve. 
(See Exercise 60 for an alternative equivalent approach.) What is the generalization 
of equation (5.19) to arbitrary coordinates Z ! in which the curve is given by 
(Z 1 ( t ) , Z 2 ( t ) , Z 3 (?)). The general formula for L is 



(5.20) 


Why is the formula (5.20) correct? In Cartesian coordinates, the metric tensor Z ;/ - is 
represented by the identity matrix. Therefore, since only the diagonal terms survive, 
equation (5.20) agrees with equation (5.19). So we are on the right track! The 
complete proof of equation (5.20) is found in Chap. 6. The key to the proof is the 
tensor property of all the elements in equation (5.20). 

Equation (5.20) can be interpreted in another fascinating way. Suppose you find 
yourself in a Euclidean space referred to some coordinate system Z'. Suppose 
also, that the only measurement you are allowed to take is the length of any curve 
segment. From equation (5.20), it follows that this would allow you to calculate the 
metric tensor at any point. Thus, lengths of curves can be calculated from the metric 
tensor and vice versa. 

Consider the point P = ( Z, 1 , , Z ( 2 . Z ( > ) and the curve given parametrically by 
(Zy + t, Zq, Z 3 ). This curve is the coordinate line corresponding to Z 1 that passes 
through P at t = 0. Denote the length of this curve from the point P by L\\ (t). 
Since only a single term survives under the square root in equation (5.20), 



(5.21) 


By the fundamental theorem of calculus, 


L' n (0) = ^Z^ at P. 


(5.22) 


We can similarly compute the remaining diagonal entries of Z (/ - . 
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To compute the off-diagonal terms, say Z\ 2 , consider the curve (Z ( | + t,Z\ 
+t, Zq) and denote its length from the point P by L \2 (t). This time, three terms 
survive under the square root in equation (5.20): 

L n (t)= [ y/Z u (h) + 2 Z 12 (h) + Z 22 (h)dh. (5.23) 

Jo 

Thus, 

L' u (0) = y/Z n + 2 Z 12 + Z 22 at P. (5.24) 

Since the diagonal terms Z\\ and Z 22 are already available, we are now able to 
compute Z 12 and, similarly, all the remaining entries of the metric tensor. 


5.9 Intrinsic Objects and Riemann Spaces 

Objects, such as the metric tensor, which can be obtained by measuring distances 
and computing the derivatives of those distances, are called intrinsic. Therefore, all 
objects that can be expressed in terms of the metric tensor and its derivatives are also 
intrinsic. Most notably, the Christoffel symbol presented below and the Riemann- 
Christoffel tensor presented in Chap. 8 are intrinsic. Intrinsic objects are particularly 
important in general relativity and in the study of embedded surfaces to which Parts 
II and III of this book are devoted. 

It turns out that much of differential geometry and tensor calculus can be 
constructed from the metric tensor as the starting point, bypassing the position 
vector R and the covariant basis Z, . Differential geometry that starts with the metric 
tensor is called Riemannian geometry. Its exclusive focus is on intrinsic objects. 
In Riemannian geometry, the dot product is defined in terms of the metric tensor, 
rather than the other way around. Thus, the metric tensor is a primary concept and 
equation (5.9) is the definition of the dot product. The properties of the space are 
dictated by the choice of the metric tensor. The metric tensor need not come from 
a Euclidean dot product, but can be rather arbitrarily assigned subject to certain 
conditions such as symmetry and positive definiteness. A coordinate space in which 
the metric tensor field is a priori given is called a Riemann space. The Riemannian 
perspective is critically important, even in Euclidean spaces. 


5.10 Decomposition with Respect to a Basis by Dot Product 

Linear algebra offers an elegant algorithm for calculating the components of a vector 
with respect to an orthogonal basis. Suppose that ei , . . . , is an orthogonal basis, 
that is 


5.10 Decomposition with Respect to a Basis by Dot Product 
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(e, ,e ; ) = 0 , if I ± j. 


(5.25) 


Let Vj be the components of a vector Y with respect to this basis: 

V= Vid + •■•+ V N e N . (5.26) 


Dot both sides of this equation with e, . Since all but one term on the right-hand side 
vanish by the orthogonality of the basis, we find 

(V, e,) = V, (e,, e,) . (5.27) 


Therefore, the coefficient V) is given by the simple expression 


(V. e, 
(e,,e,) 


(5.28) 


This expression simplifies further if the basis is orthonormal, that is, each basis 
vector is unit length: 


(e, , e, ) = 1. (5.29) 

With respect to an orthonormal basis, Vj is given by a single dot product 

K- = (V, e,) . (5.30) 

If the basis ei, . . . ,ejv is not orthogonal, it is still possible to calculate V; by 
evaluating dot products. However, the simplicity of the algorithm is lost. Instead 
of being able to determine each component Vj individually, one needs to solve an 
N x N system of coupled equations 


(ei,ei) • 

• (ei,ejv) 

' V\ ' 


~ (V, e,) _ 

_ (ev, eO • 

• (ejv, e^)_ 

_V N _ 


.(v, ejv )_ 


(5.31) 


Note that the matrix on the left-hand side corresponds to the metric tensor Zjj . 

In tensor calculus, this relatively complicated procedure is captured by a formula 
that is as simple as equation (5.30). Namely, suppose that V are the contravariant 
components of the vector V with respect to Z, . Then 

V‘ = V • Z' . (5.32) 

Note that V is dotted with the contravariant basis Z' . The proof of this relationship 
is simple 


Y-Z‘ = V j Zj ■ zi = v j si = v\ 


(5.33) 
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The formula in equation (5.32) is equivalent to equation (5.31) but has the simplicity 
of equation (5.30). Equation (5.32) states that V can be calculated by single dot 
product even if the basis is not orthogonal. Of course, the single dot product involves 
the contravariant vector T , the calculation of which involved the matrix inversion 
of the covariant metric tensor. Therefore, the algorithm implied by equation (5.32) is 
no simpler than that implied by equation (5.31). It is the same algorithm. However, 
its expression in tensor calculus is far more effective and concise. 


5.11 The Fundamental Elements in Various Coordinates 

This section catalogs the fundamental elements in the Euclidean space for the most 
common coordinate systems. In affine and polar coordinates, all elements can be 
calculated easily by geometric reasoning. Spherical coordinates pose a little bit more 
of a challenge. In order to calculate covariant basis Z, in spherical coordinates, 
a background Cartesian coordinate system is introduced. Earlier, we emphatically 
argued against background Cartesian coordinates. On the other hand, this section 
deals with a particular problem that can be solved more easily by choosing a 
convenient coordinate system. However, it is important to continue to think of the 
bases Z, and Z ' as primary objects that do not require another, more fundamental, 
coordinate system in order to be expressed. To reinforce this perspective, we 
illustrate the covariant basis Z, in Fig. 5.5 without the scaffolding of the background 
Cartesian coordinates. 


5.11.1 Cartesian Coordinates 

The covariant basis Z, coincides with the coordinate basis i,j,k. The coordinate 
basis arises naturally in affine (and therefore Cartesian) coordinates as a set of vec- 
tors that point along the coordinate axes and have the length of one corresponding 
coordinate unit. 

The geometric algorithm, by which the derivative Zi = 3R/3x is evaluated, is 
illustrated in Fig. 5.2. For any h, the vector 

V = + (5.34) 

h 

is the unit vector that points in the direction of the x-axis. Therefore, Y is the 
Cartesian basis element i. The limit of Y as h — > 0 is also i, and we conclude 
that the covariant basis Z, coincides with the coordinate basis: 

Z, = i; 


Z 2 = j: Z 3 = k. 


5.11 The Fundamental Elements in Various Coordinates 
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Fig. 5.2 Geometric 
construction of the basis 
vector Zi = 9R/3.V in 
Cartesian coordinates 



Consequently, the covariant and the contravariant metric tensors have only three 
nonzero entries: 


Z n = Z 22 = Z 33 = 1 (5.35) 

Z 11 = Z 22 = Z 33 = 1. (5.36) 


In the matrix notation, the metric tensors are presented by the identity matrix: 


Zij,Z iJ = 


1 0 0 
0 1 0 
0 0 1 


(5.37) 


The contravariant basis Z' consists of the same vectors as the covariant basis Z, : 


Z 1 = i: Z 2 = j: Z 3 = k. 


(5.38) 


5.11.2 Affine Coordinates 

A similar geometric argument shows that in affine coordinates, the covariant basis 
Z, coincides with the coordinate basis i.j.k. The metric tensor Z, ; is therefore 
unchanged from one point to another. Its entries are pairwise dot products of the 
coordinate basis vectors: 
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Fig. 5.3 In affine coordinates, the covariant basis Z, lines up perfectly with the coordinate grid. 
This figure also shows the contravariant basis Z' . Note the orthogonality of the vectors Zi = i and 
Z 1 , and Z? = j and Z 2 


i i i j 

j-i j j 

k i k j 


i k 
j k 
k k 


(5.39) 


The metric tensor is diagonal only if the affine coordinate system is orthogonal. 
The contravariant metric Z' 7 is obtained by inverting the matrix (5.39) and the 
contravariant basis is obtained by contracting Z' 7 with Z, . 

We illustrate this calculation with an example in two dimensions illustrated in 
Fig. 5.3. Suppose that the affine coordinates are such that the angle between Z\ = i 
and Z 2 = j is jt/3 and that |Zi | = 2 and IZ 2 1 = 1. Then 


ZiJ = 


4 1 
1 1 


and 


Z ij = 


1 

3 

1 


1 - 

3 

4 

3 - 


The contravariant basis vectors are therefore given by 


Z 1 = 

z 2 = 


1 . 

3 1 

1 

3 



(5.40) 


(5.41) 


(5.42) 

(5.43) 


Exercise 72. Confirm that Z 1 in equation (5.42) is orthogonal to Zi and that Z 2 in 
equation (5.43) is orthogonal to Z\. 

Exercise 73. Confirm that Z 1 • Z\ = Z 2 ■ Z 2 = 1. 

Exercise 74. Confirm that Z 1 • Z 2 = Z 12 = —1/3. 


5.11 The Fundamental Elements in Various Coordinates 
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Fig. 5.4 Geometric 
construction of the vector 



Zj = 3R/9r in polar 


coordinates 


5.11.3 Polar and Cylindrical Coordinates 

Consider polar coordinates in the plane. The geometric construction that leads to Z\ 
is illustrated in Fig. 5.4. It is clear that the vector 


R (r + h.9) — R (r, 0) 

h 


V = 


(5.44) 


points radially away from the origin O. Furthermore, the length of the vector AR = 
R (r + h , 0) — R (r, 9) in the denominator is h. Therefore, V = AR/ h is unit length 
for all h. We conclude that Zi is a unit vector that points in the radial direction. 

The vector Zi is more challenging, but we have already performed the calculation 
that gives Zo in Sect. 2.9 of Chap. 2. In that section, we showed that, for r = 1, Zi is 
the unit vector in the direction orthogonal to Z\ . It is also easy to see that the length 
of Zi is proportional to r by noticing that the denominator in 


R(r,0 + h) -R(r,0) 
h 


V = 


(5.45) 


is exactly twice as large at r = 2 as it is at r = 1 . This relationship also holds in 
the limit as /) — > 0 and we conclude that Zi is a vector of length r orthogonal to Z\ . 
The covariant basis for polar coordinates is seen in Fig. 5.1. 

A coordinate system with an orthogonal covariant basis is called orthogo- 
nal. Thus, polar coordinates are orthogonal. Orthogonal coordinate systems are 
characterized by diagonal metric tensors. The metric tensors for polar coordinates 
are given by 



1 0 

0 r~ 2 


(5.46) 
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Fig. 5.5 The covariant basis 
Z, at the point R = 2, 

9 = jt/4, (j> = 0 



The contravariant basis vector Z 1 equals Zi, while Z 2 is colinear with Zi and has 
length 1 /r. 

Exercise 75. Show that Z 3 in cylindrical coordinates is a unit vector that points in 
the direction of the z-axis. 

Exercise 76. Show that metric tensors in polar coordinates are given by 



'1 0 

O 


O 

O 

N 

II 

0 r 2 

0 

II 

N 

0 r~ 2 0 


1 

O 

O 

1 


1 

O 

O 
1 


Exercise 77. Show that Z 3 = Z 3 . 


5.11.4 Spherical Coordinates 

Align a background Cartesian grid with the spherical coordinates r. 9 , (p in the 
natural way: the z-axis coincide with the azimuthal axis 9 = 0 and let the x-axis 
coincide with the azimuthal axis <p = 0. The position vector R is given by 


R (r, 9,(f>) = r sin 9 cos (p i + r sin 9 sirup j + r cos 9 k. (5.48) 

The covariant basis is obtained by partial differentiation with respect to r, 9 , and <p: 

Zi = sin 9 cos </> i + sin 9 sin (p j + cos 9 k (5.49) 

Z 2 = r cos 9 cos cp i + r cos 9 sin0 j — r sin 9 k (5.50) 

Z 3 = — r sin 9 sin^> i + r sin 9 cos (p j. (5.51) 

The covariant basis is illustrated in Fig. 5.5. 


5.12 The Christoffel Symbol T,* 
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The metric tensors are given by 


"1 


'l 

r 2 

; z iJ = 

r- 2 

r 2 sin 2 0 _ 


1 

(N 

1 

_c 

CZ) 

rM 

1 

1 


The contravariant basis Z' is 

Z 1 = sin 0 cos (f> i + sin 0 sin 0 j + cos 0 k 

Z 2 = r _1 cos 0 cos <p i + r _1 cos 0 sin (f> j — r _1 sin 0 k 

Z 3 = — r -1 sin -1 0 sin0 i + r -1 sin -1 0 coscp j. 


(5.52) 


(5.53) 

(5.54) 

(5.55) 


5.12 The Christoffel Symbol T* 

In affine coordinates, the covariant basis Z, is the same at all points. In curvilinear 
coordinate systems, the basis varies from one point to another. The variation 
can be described by the partial derivatives 3Z ,/dZ J . The expression 3Z i/dZ^ 
represents N 2 vectors: each of the basis elements differentiated with respect to 
each of the coordinates. Each of the N 2 vectors 3Z//3Z 7 can be decomposed with 
respect to the covariant basis Z*. The resulting /V 3 coefficients form the Christoffel 
symbol Ty : 


3Z,- 
3 Zf 


T U Zk - 


(5.56) 


This equation represents N 2 vector identities that define /V 3 scalar entries of T^- . 

The Christoffel symbol hrst appeared in Elwin Bruno Christoffel’s master- 
piece [7]. This work was a precursor to Gregorio Ricci and Tullio Levi-Civita’s 
paper [34] that announced tensor calculus. The Christoffel symbol plays an extraor- 
dinarily important behind-the-scenes role. Its value becomes apparent in Chap. 8 in 
which the covariant derivative is introduced. 

The object 3Z,- /3Z y is the second derivative of the position vector R 


3Z, _ 3 2 R 

3 Yi ~ 3 Zi 3Z' ' 


(5.57) 


Therefore, 

3Z ; _ 3Z j 
dZJ ~ 3 Z 7 ’ 


(5.58) 
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Fig. 5.6 Elwin Bruno 
Christoffel (1829-1900), 
German physicist and 
mathematician, was a central 
figure in the origins of tensor 
calculus 



which implies that the Christoffel symbol is symmetric in the lower indices: 


1 ij ~ 1 ji' 

The explicit expression for the Christoffel symbol is 


r * = z k 


dz i 
~8ZJ' 


(5.59) 


(5.60) 


The equation follows immediately from the discussion in Sect. 5.10. Start with 
equation (5.56) and dot both sides with ZJ : 


Z 1 ■ 


dZj 

9Z7 


lyZfZ'. 


By equation (5.16), Z/, • Z. 1 = 8 l k , therefore 


(5.61) 


, dZj_ 

dZJ 


r M 


(5.62) 


Renaming the index / into k yields equation (5.60). 

In Euclidean spaces, equations (5.56) and (5.60) are equivalent. In Chap. 10, 
where we discuss differentiation on curved surfaces, we discover that equation (5.60) 
is a more universal approach to defining the Christoffel symbol than equation (5.56). 

The Christoffel symbol also appears in the decomposition of the partial 
derivatives 3Z ! /3 Z J of the contravariant basis Z'. Transform equation (5.60) 
by the product rule 


, _ 3 (Z k ■ Z,) 

y dZJ 


dZ k 

3Z7' Z '' 


(5.63) 


5.12 The Christoffel Symbol T,* 
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The product Z /: • Z, is the Kronecker symbol S k . Since 8 k has constant elements, its 
partial derivatives vanish. This yields a new expression for the Christoffel symbol 




from which it can be seen that 3Z k /8Z J is given by 


dZ k 

dZJ 


-r*z'. 


(5.64) 


(5.65) 


Note the similarities and the differences between equations (5.56) and (5.65). In 
both cases, the Christoffel symbol is the key to expressing the partial derivatives 
of the basis elements. On the other hand, the equations differ in sign — this sign 
difference between covariant and contravariant objects permeates all of tensor cal- 
culus. The second important difference is the roles of the indices. In equation (5.56), 
the two lower indices (/ and j) are free and the upper index participates in the 
contraction. In equation (5.65), the upper and lower indices ( k and j) are free and 
the other lower index is contracted. 

We now mention the remarkable property that the Christoffel symbol is intrinsic. 
The expression for T^ in terms of the metric tensor reads 


pi _ }_z k > n ( dZ m j _ dZjj \ 
ij 2 \ 3 Z> dZ‘ 3 Z m ) 


(5.66) 


In our approach, the Christoffel symbol was defined extrinsically in terms of 
the bases Z, and Z'. Therefore, the fact that it is actually an intrinsic object is 
surprising. In Riemannian geometry, where the starting point is the metric tensor, 
equation (5.66) serves as the definition of the Christoffel symbol. The derivation of 
equation (5.66) is given in the following exercises. 

Exercise 78. Show that 


= Zli T l jk + ZijT l ik . (5.67) 

The combination ZuTj k can be denoted by T,^- This is an example of index 
juggling introduced in the next chapter. Some texts refer to T, jk as the Christoffel 
symbol of the first kind and T'. fc as the Christoffel symbol of the second kind. In 
terms of the Christoffel symbol of the first kind, the expression for 3Z, y /3 Z k reads 


3fw 

3 Z k 


T/./a + r Uik . 


(5.68) 
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Exercise 79. By renaming indices, obtain an expression for each of the terms in 
parentheses of equation (5.66). 

Exercise 80. Combine all expressions in the preceding exercise to derive 
equation (5.66). 

Exercise 81. Consider a material particle moving along a curve with parametriza- 
tion with respect to time given by 


Z' = Z' (?) . (5.69) 

Suppose that the velocity of the material particle is given by V (?). Show that the 
component V ( ? ) of V (?) is given by 


V (?) = 


dZ i (?) 
dt 


(5.70) 


Hint: V(?) = R' (?). 

Exercise 82. Show that the component A' (?) of acceleration A (?) of the particle 
from the preceding exercise is given by 

dV‘ ■ , 

A‘ = — + T) k V J V k . (5.71) 

Exercise 83. Let B (?) = A' (?) be the rate of change in acceleration A (?). Show 
that B' (?) is given by 

dA' 

B‘ = — + T) k A J V k . (5.72) 

Exercise 84. For a general vector field U (?) defined along the curve Z' = Z 1 (?), 
show that 

u' (?) = + Wr ) k U k \ Z „ (5.73) 

Therefore, if U (?) is constant along the curve, we have 


dU‘ 


+ v j r jk u k 


= 0 . 


dt 


(5.74) 


5.13 The Order of Indices 
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5.13 The Order of Indices 


With the introduction of the Christoffel symbol, we have started using the tensor 
notation in earnest. We must appreciate the ease with which complex operations are 
expressed in tensor notation. Consider the now familiar change of identities 


, k 3Z, _ 3 ( Z k ■ Z,) 8Z k _ 3Z 
' 3 ZJ ~ dZJ dZJ ' ‘ dZJ ' 


In three dimensions, this equation represents 27 simultaneous calculations. Yet, the 
calculation flows as easily as if it were a single calculation. 

The Christoffel symbol has three indices, the largest number of indices we have 
encountered so far. The second Bianchi identity (12.15) in Chap. 12 includes an ob- 
ject with five free indices. It is impractical to treat these objects as multidimensional 
matrices with a specialized operation of multiplication. In tensor calculus, it is best 
to think of variants, regardless of dimension, as an indexed collection of values. In 
other words, a variant is an object that yields a value for every valid combination of 
indices. 

The order of indices is important. For example, what is the value of the 
Christoffel symbol in spherical coordinates at the point r = 1, 6 = jt/ 2, </> = 0 for 
the index values 1,3, 2? This question is unclear because it is not indicated which 
index is meant to have which value. It is often impractical to refer to indices by name 
since indices can be renamed. A better way to eliminate the ambiguity is to agree 
on the order of the indices. For the Christoffel symbol T k j, there are two competing 
conventions regarding the order of the indices. According to the convention adopted 
here, the upper index is first and the two lower indices are second and third. 

For variants with indices that are all lower or all upper, the order is self-evident. 
Such variants include the metric tensors Z, 7 and Z ,] . For some variants, where the 
index placements are mixed, the order does not matter. For example, the Kronecker 
symbol 8‘- is zero for index values 1 and 2 regardless of which index has value 1 
and which has value 2. For all other variants the order is important and must be 
maintained either by convention, as in the case of the Christoffel symbol, or by 
a special notational device such as the dot place keeper. For example, in order to 
indicate that the upper index is first for a variant A l - we can write it as A[ - . If we 
want the upper index to be second, we write it as A'j . The dot approach is discussed 
further in Chap. 6 in the context of index juggling. 
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5.14 The Christoffel Symbol in Various Coordinates 
5.14.1 Cartesian and Affine Coordinates 

In affine coordinates, and Cartesian in particular, the Christoffel symbol vanishes at 
all points 


"i 

II 

O 

(5.76) 

This is a differential way of saying that the covariant basis, and therefore the metric 
tensors, are unchanged from one point to another. 

5.14.2 Cylindrical Coordinates 


The nonzero Christoffel elements are given by 


r 22 = -r 

(5.77) 

p 2 p2 ^ 

1 12 - 1 21 - r - 

(5.78) 

5.14.3 Spherical Coordinates 


Letting R = 1, @ = 2, and <t> = 3, the nonzero 
given by 

Christoffel symbol entries are 

r©e = ~ r 

(5.79) 

r|<j, = —r sin 2 6 

(5.80) 

p0 p 0 ^ 

1 R@ ~ 1 &R ~ 

(5.81) 

r®^ = — sin 0 cos 0 

(5.82) 

p p <t> ^ 

1 Ri> — 1 — ~ 

(5.83) 

rf* = rfe = cote. 

(5.84) 


Exercise 85. Derive the Christoffel symbols in cylindrical and spherical coordi- 
nates by equation (5.60). 


5.15 Summary 
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Exercise 86. Derive the Christoffel symbols in cylindrical and spherical coordi- 
nates by equation (5.66). 


5.15 Summary 

In this chapter, we introduced the most important differential elements in the 
Euclidean space: the position vector R, the covariant and contravariant bases Z ; - 
and Z\ the covariant and contravariant metric tensors Z ;J and Z' 7 , and the volume 
element \[Z. These objects are collectively known as the metrics. There are two 
more objects that are considered metrics: the volume element \[Z and the Levi- 
Civita symbols e ,jk are discussed. Both are introduced in Chap. 9. The volume 
element plays a particularly crucial role in the subject of integrals (Chap. 14). 

In the next chapter we study how variants transform under a change of coor- 
dinates. We learn that the metrics and a number of other objects are tensors (that 
is objects that transform according to a special rule) and discuss the important 
implications of the tensor property. 


Chapter 6 

The Tensor Property 


6.1 Preview 

In this chapter we get down to the business of tensors. We introduce the concept 
of a variant and define tensors as variants that transform from one coordinate 
system to another by a very special rule. We define covariance and contravariance 
and demonstrate how these opposite modes of transformation under a change of 
coordinates lead to the main goal of tensor calculus: invariance. 


6.2 Variants 

A variant is an object that can be constructed by a similar rule in various coordinate 
systems. In the preceding chapter, we encountered a number of variants. For 
example, the covariant basis Z, is a variant because it is obtained in any coordinate 
system by the same rule — partial differentiation of the position vector R. The metric 
tensor Z , 7 is also a variant because it is formed by pairwise dot products of the 
elements of the covariant basis. Similarly, every object introduced in the preceding 
chapter is a variant, including the Christoffel symbol Vf- . 

The Jacobian jf is not a variant. It expresses a relationship between two 
coordinate systems and therefore cannot be constructed given one coordinate 
system. Similarly, second-order derivatives Jjj are not variants. If one of the 
coordinate systems is fixed and the other one is allowed to change, then J- and 
J‘' can be considered variants with respect to the changing coordinate system. 

Naturally, when the same rule is applied in different coordinate systems, the 
results differ. This is the motivation behind calling these objects variants. Among 
variants, there stands out a very special class of objects that transform according to a 
special rule. These objects are called tensors, and we now turn to the study of these 
crucially important objects. We note in advance that all metrics Z,-, Z' , Z, y and Z' ; 
are tensors. 
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6 The Tensor Property 


6.3 Definitions and Essential Ideas 
6.3.1 Tensors of Order One 

A variant T, is called a covariant tensor if its values 7) and f> in the coordinate 
systems Z' and Z' are related by 


T' = Tjji,, ( 6 . 1 ) 

where J l , is the Jacobian defined in equation (4.70). A variant T' is called a 
contravariant tensor if T' and T 1 ' are related by the Jacobian jf defined in (4.71) 

T 1 ' = T l jf. (6.2) 

The term covariant means transforms in the same way as the basis 7u\ and the term 
contravariant means transforms in the opposite way. Recall that the Jacobians Jf 
and J ■ are the matrix inverses of each other. Covariant tensors are denoted by 
lower indices and contravariant tensors are denoted by upper indices. In addition to 
numerous other advantages, the placement of the index makes it easy to remember 
whether Jf or Jf is to be used. 

The fundamental importance of the tensor property is rooted in the fact that 
Jf and Ji are the matrix inverses of each other. This relationship is expressed 
in equations (4.74) and (4.76). Let us repeat these identities here since we will make 
frequent use of them in this chapter: 

JfJ- = 8) (4.74) 

Jf J), = 8‘j,. (4.76) 


6.3.2 Tensors Are the Key to Invariance 

To appreciate the importance of the tensor property, let us prove a special case of the 
contraction theorem. It is the fundamental theorem of tensor calculus and it gives 
us a recipe for producing geometrically meaningful invariants. Suppose that .S’, is a 
covariant tensor and T' is a contravariant tensor. Then the contraction 

U = SjT (6.3) 

is invariant. That is, U evaluates to the same value in all coordinate systems! 

To demonstrate this property, consider the quantity U' evaluated in the primed 
coordinates according to the same contraction as U in the unprimed coordinates: 


U' = Si'T' . 


(6.4) 


6.3 Definitions and Essential Ideas 
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Relate the values S,/ and T 1 ' to their values .S', and T' in the unprimed coordinates: 

U' = SiJj,T k Jl'. (6.5) 

We introduced a new index k in expressing T 1 ' since i was used for 5’,/. Note, that 
because of the relationship (4.74) between the Jacobians J l , and J' k , U' is given by 

U' = SiT k &[, (6.6) 


or, since T k S‘ k = T‘ , we find 


In other words. 


u' = ,s’,r. 


u' = u 


(6.7) 


(6.8) 


and we have shown that the variant U evaluates to the same value in all coordinate 
systems. The keys to the invariance are the tensor property of Sj and T' and the 
inverse relationship between the Jacobians. 


6.3.3 The Tensor Property ofZj 

We now show that the covariant basis Z, is indeed a covariant tensor and so the term 
covariant is justified. The demonstration of the tensor property usually proceeds as 
follows. We start with the definition of the variant in the primed coordinate system 
and then relate its values in the primed coordinates to its values in the unprimed 
coordinates. If the relationship is of type (6.1) or (6.2) then the variant is a tensor of 
the corresponding kind. 

Let us first look at the one-line proof of the tensor property Z, and then discuss 
each step: 


Z i' = 


3R(Z') 3R(Z(Z')) 


dZ' r 


3Z'' 


3R 3Z' 

dzJdzJ' 


Z ,■//,. 


(6.9) 


Now, one step at a time. The covariant basis Z,/ in the primed coordinate system is 
given by the partial derivative of the position vector R with respect to the primed 
coordinate Z' : 


Z,v = 


3R(Z') 

3Z'' 


(6.10) 
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The function R ( Z') gives the position vector R as a function of the coordinates 
Z' . The following is the critical step from which the ultimate relationship is 
derived. The function R (Z r ) can also be constructed by substituting Z' ( Z' ) — the 
expression for the unprimed coordinates in terms of the primed coordinates — into 
R(Z), that is the function that gives the position vector R as a function of the 
unprimed coordinates Z’ : 


R(Z') = R(Z(Z')) . (6.11) 

The next step is differentiation by the chain rule, the tensor form of which you 
mastered in Chap. 4: 


9R(Z(Z')) 9R dZ i 


(6.12) 


9Z'' dZ i 9Z'' ' 

Finally, we recognize that 9R/9Z' is Z; and 9Z'/9Z' is J',. Summarizing 

Z i ' = Z i jf„ (6.13) 

which shows that Z, is indeed a covariant tensor. 


6.3.4 The Reverse Tensor Relationship 

Suppose that 7} is a covariant tensor, that is, 7}/ = 7J What is T, in terms of T/l 
For a moment, consider this question in the language of linear algebra: if T r = JT, 
then what is T in terms of T'l It is obtained by left-multiplying both sides of the 
identity by / -1 , yielding T = J 1 T' . In tensor algebra terms, the same effect is 
achieved by multiplying both sides of the identity 7)/ = 7} J'., by /j : 

Ti'J'j = T{jj,Jj . (6.14) 

Since J‘, Jj ' = S'j by equation (4.74), we have 

Ti'J'j' = TjS'j =Tj. (6.15) 

Finally, renaming the index j to i , we find 

T = Ti'jf. (6.16) 

Similarly, if T' is the contravariant tensor, then the reverse of T 1 ' = T‘ J! reads 


6.3 Definitions and Essential Ideas 
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Equations (6.16), (6.17), in conjunction with (6.1), (6.2), offer greater flexibility 
in demonstrating the tensor property. On some occasions, it is easier to start with 
the unprimed components 7} or T' and relate them to the primed components 
7jv and T‘ . On other occasions, its more natural to go the other way. Equa- 
tions (6.1), (6.2), (6.16) and (6.17) help move effortlessly in either direction. 

There is a noteworthy elegance in equations (6.1), (6.2), (6.16), (6.17) which is 
found in all tensor relationships: the placement of the indices leads to the proper 
choice of objects. For example, you may not find it easy to remember whether 
T' and T' are related by ./ ', or its inverse J\ . The matching of indices naturally 
suggests the correct equations (6.2) and (6.17). This property of the tensor notation 
is one of its greatest assets. 


6.3.5 Tensor Property of Vector Components 

We next show that components V of a vector Y with respect to the covariant basis 
Zj form a contravariant tensor. As always, begin by writing the definition of V in 
the primed coordinates: 

\=V i 'z ,■/. (6.18) 

By the tensor property of Z, , we have 

V=V i 'jj,Z i . (6.19) 

In the unprimed coordinates, the vector Y is given by 

V = K'Z,. (6.20) 

Since decomposition with respect to a basis is unique, the last two equations yield 
the following relationship between V and V : 

V = V 1 ' J\,. (6.21) 

This proves the contravariant property of vector components V . 

We have previously shown that if S, and T l are a covariant tensor and a 
contravariant tensor, then the contraction 5, T‘ is an invariant. In the foregoing 
proof, we have essentially demonstrated a converse: If S, T' is invariant and one 
of the terms in the product is a tensor, then the other term is also a tensor. This is a 
special case of a more general quotient theorem. 
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6.3. 6 The Tensor Property of Z 1 

The tensor property of the contravariant basis Z' can be proved by similar reasoning. 
Recall the following relationship between the covariant and contravariant bases: 

Z'-Z j=8). (5.16) 

Multiply both sides by J-' Jj, 

ZJjf -Zjjj, = (6.22) 

As we stressed in Chap. 4, the order of the multiplicative terms is immaterial. 
Furthermore, the placement of the dot ■ in the dot product is arbitrary, and the result 
is not affected by the order of operations. 

By the tensor property of Z . , the contraction Z , J J , is Z . Furthermore <$' J‘ J 

J J J J J l J 

is 8j, (why?), yielding: 

Z‘jf ■ Z r = 8‘y. (6.23) 

This identity tells us that the set of three vectors Z' J! is mutually orthogonal to 
Zj*. Therefore, on linear algebra grounds, the vector Z' jf must be Z 1 ': 

ZJ' = ZJj;'. (6.24) 

This proves the contravariant property of Z' . 


6.4 Tensors of Higher Order 

The definition of tensor generalizes to an arbitrary number of indices. For example, 
a variant 7 )j is a doubly covariant tensor if its primed values 7}/ ■, relate to the 
unprimed 7 ;/ - by the identity 


Ti'V = TjJi'Jr (6-25) 

Similarly, 7' J is a doubly contravariant tensor if T’' J ' and T IJ are related by 

T'i' = T ij jfjf. (6.26) 


Finally, T‘ is tensor of one contravariant order and one covariant order if 

V' = 7 Pfjj,. 


(6.27) 


6.4 Tensors of Higher Order 
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When stating that a particular variant is a tensor, one need not mention the flavor 
(covariant or contravariant) of the index since it is indicated by the placement of the 
index. Therefore, 7y , T ' J , and Tj are all described simply as second-order tensors. 

The tensor definition is easily extended to tensors of any order by operating on 
each index with the proper Jacobian. For instance Rijki is a fourth-order tensor if 

Ri'j'k'V = Rijki 4 J f 4 4 • (6.28) 


6.4.1 The Tensor Property of Zjj and Z l * 

We now show that Zy and Z' y are tensors. For the covariant metric tensor Zy, 
its components in the primed coordinates are, by definition, the dot product of the 
primed covariant basis vectors: 


Zfj' = Zy • Zj'. (6.29) 

Utilizing the covariant property of the covariant basis, we have 

Zi'j' = ZijL-Zjjf. (6.30) 

Recognizing that Z, • Z , = Zy we arrive at the identity 

Zi r j ' - z ;/ (6.31) 

which proves the tensor property of Zy . 

The tensor property of Z' y can be established in two ways. Since we have already 
established the tensor property of Z' , the tensor property of Z' y follows easily by 
analogy with Zy : 

Z'' y ' = Z'' • TJ' = Ti jf ■ Z y jf = Z ij jfjf. (6.32) 

1 J 1 J 

On the other hand, the tensor property of Z' y can be established by a quotient 
argument similar to the argument that showed the tensor property of Z' . 

Exercise 87. Prove the tensor property of Z' y from the identity z ij z jk = Si 


6.4.2 The Tensor Property of 8\ 

The Kronecker symbol S'- may seem like an invariant: it has the same values in all 
coordinates. However, the concept of invariant only applies to variants of order zero. 
The Kronecker symbol as a variant of order two. But is it a tensor? The answer is 
yes. Below, we prove that a product (or a dot product) of two tensors is a tensor. 
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This implies that S'j is a tensor because it is the dot product of two tensors: 5‘- = 
Z ! • Z j . On the other hand, we can show the tensor property of the Kronecker symbol 
directly. Multiply 8'j by the combination J-' Jj,. The result is 8'y 

S'j jf jj, = = V-,, (6.33) 

which shows that the values of the Kronecker symbol in different coordinate systems 
relate by the tensor rule. Therefore, the Kronecker symbol is a tensor. 


6.5 Exercises 


Exercise 88. Show that, for a scalar field F , the collection of partial derivatives 
8F/dZ' is a covariant tensor. 

Exercise 89. Given a scalar field F, show that the collection of second-order partial 
derivatives d 2 F/dZ' 3 Z J is not a tensor. More generally, generally show that, for a 
covariant tensor field 7), the variant 97/ /3Z J is not a tensor. 

Exercise 90. Show that the skew-symmetric part 5y 


, _ 97/ 97} 
,y dZJ 3Z 1 ' 


(6.34) 


of the variant 97/ /9 Z J is a tensor. 

Exercise 91. Similarly, for a contravariant tensor T ‘ , derive the transformation rule 
for 97’’ /3Z- 7 , and show that it is not a tensor. 

Exercise 92. Derive the transformation rule for the Christoffel symbol T^ on the 
basis of equation (5.60) and show that is not a tensor. 

Exercise 93. For a tensor 7/ y - , derive the transformation rule for 97y / 3 Z k and show 
that it is not a tensor. Apply the obtained relationship to the metric tensor Z/ ; - and 
use the result to obtain the transformation rule for the Christoffel symbol on the 
basis of equation (5.66). 

Exercise 94. Show that if a tensor vanishes in one coordinate system, then it 
vanishes in all coordinate systems. 

The tensor property may sometimes be considered with respect to a subclass of 
transformations. That is, a variant may not be a tensor in the general sense, but still a 
tensor with respect to a narrower class of coordinate transformations, such as linear 
transformations characterized by Jacobians with positive determinants. 


6.6 The Fundamental Properties of Tensors 
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Exercise 95. Which of the objects considered in the preceding exercises are tensors 
with respect to linear transformations 

Z'' = A i /z i + b 1 ', (6.35) 

without being tensors in the general sense? 


6.6 The Fundamental Properties of Tensors 
6.6.1 Sum of Tensors 

The sum of two tensors is a tensor. It is implied that the two tensors being added 
have identical index signatures and the result is a tensor with the same signature. 
We prove this property by considering a representative indicial signature. Let A‘- k 
be the sum of B‘- k and C‘ k : 


A) k = B) k+ C) k . 


(6.36) 


Then 


A 1 


j'k' 


B’j, k , + C'j, k , 


B) k Ji 44 + C) k Jt JU, 


J k’ 


(6.37) 


Summarizing, 





(6.38) 


which proves that A'- k is a tensor. 

Exercise 96. Show that a linear combination of tensors is a tensor. 


6.6.2 Product of Tensors 

Any two variants can be multiplied together to form a new variant. The result is 
called the tensor product and it has as many indices as the multiplicative terms 
combined. For example, 


A 


jkl 


Bj C k i ■ 


(6.39) 
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The tensor product of tensors is a tensor. The proof for representative indicial 
signatures is completely straightforward: 


*\>k>v = = B)c kl j[H44 = 4ki44' J k'4- 


rk rl 


(6.40) 


Similarly, the dot product of two vector- valued tensors is a tensor, and the product 
of a scalar- valued tensor with a vector- valued tensor is a tensor. 


6.6.3 The Contraction Theorem 

We have already discussed a special case of the contraction theorem and learned 
that it is the key to forming invariants. Here we give a general form of the theorem 
and discuss its important implications. The theorem states: contraction of a tensor 
is a tensor. Of course, a contraction is only valid when it is over one covariant and 
one contravariant index. 

The proof once again proceeds by example. The key to the proof is, of course, the 
inverse relationships between the Jacobians. Suppose that S l - k is a tensor of order 
three. We will show that T k = S' jk is a tensor. We first express the fact that S' jk is a 
tensor 


S )'k' 


Sj k jf Jy J k ' ■ 


Contracting both sides on V and j' yields 


i'V 


C< fj jk 
° jk J i 


Since Jf J-, = <5/ , we find 


or 


S\, k i — Sj k S{ J k ,, 

S\ r k' = 


In other words. 


Tv = T k J k k , 


(6.41) 

(6.42) 

(6.43) 

(6.44) 

(6.45) 


and the desired relationship is obtained. This demonstration for a particular index 
signature should convince you that the theorem holds for arbitrary tensors. 



6.7 Exercises 
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6.6.4 The Important Implications of the Contraction Theorem 

We have shown that a variant built up from tensors is a tensor. Building up 
an expression includes forming linear combinations, products and contractions. 
Importantly, in this scheme, invariants can be treated as tensors of order zero. For 
example, if U is an invariant, then by definition 

U' = U. (6.46) 


which is consistent with the definition of a tensor with zero indices. 

Therefore, by the contraction theorem, when all indices are contracted away, the 
result is an invariant. For example, if Sj is a tensor, then S‘ is an invariant. If Sjj is 
a tensor, then Sf is an invariant and .Sjj is an invariant as well, but a different one. 
If Sj is a tensor and Tf is a tensor, then Sj 7’ ; ,/ is an invariant by combinations of 
the product and contraction theorems. If Sj is a tensor, then (sf8 l j — Sj S k j SjS/ is 
an invariant by combination of the sum, product, and contraction theorems. 

And so, we developed a strategy for forming invariants: stick tensors and tensor 
operations — linear combinations, products, and contractions. This is the central 
idea of tensor calculus and an important moment in this book. Earlier we argued 
that while coordinate systems are an irreplaceable tool for solving problem, they 
often lead to loss of geometric insight and unmanageable calculations. We have just 
found a remedy to many of the coordinate ills! The tensor framework provides an 
approach in which calculations can take full advantage of coordinate systems while 
maintaining the geometric insight. 

A tensor is a beautiful object. It is not an invariant and, in a particular coordinate 
system, it exhibits some features that reflect its geometric origins and some 
features that are strictly artifacts of the coordinate system. Nevertheless, the 
artifacts of the coordinate system are present in a systematic tensor fashion and 
are removed by contraction, leaving us with a geometric object. This is what makes 
tensors meaningful. 

Many a poetic analogy can be made for tensors. One is that of a photographic 
negative. A negative is not a photograph but it has all the information that is needed 
to make one. The equipment and the materials required to make a photograph 
represent the contraction in the analogy. Importantly, while a negative is not a 
photograph, it is actually better than a photograph for a number of photograph- 
making tasks. For example, a negative can be used to make a photograph of many 
reasonable sizes without loss of quality. A negative is not meaningful by itself 
(neither is a tensor), but in combination with proper equipment (a tensor needs 
contraction) it can produce a photograph (tensors lead to invariants). 


6.7 Exercises 


Exercise 97. Suppose that S, and T jk are tensors. Prove that 5, T‘ J is a tensor. 
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Exercise 98. Since S', is a tensor, S' must be an invariant. What is the value of S' 
and what is its geometric meaning? 

Exercise 99. Suppose that V, ; is a tensor with vector elements. Show that Vjj, the 
components V,y with respect to the covariant basis Z, , is a tensor. 

Exercise 100. Give a variant arbitrary values in some coordinate system and let 
it transform by the proper tensor rules. Show that this construction produces a 
tensor. More specifically, single out a particular coordinate system Z' and choose 
an arbitrary collection of values for T'- J . Then, in coordinates Z', define T k by 



(6.47) 


Show that T k is a tensor. 


6.8 The Gradient Revisited and Fixed 

We now have all the tools to address the issue of the gradient that was raised 
earlier. Recall that the gradient V F of a scalar field F has a perfectly clear invariant 
definition: it is a vector that points in the direction of the greatest increase of F and 
its magnitude equals the directional derivative of F in that direction. 

The problem arose when we attempted to find an analytical expression for the 
gradient that yielded the correct result in all coordinate systems. The candidate 
expression was the combination 



(6.48) 


Note that in this expression we must use the summation sign because both 
indices are lower and the summation convention does not apply. We learned that 
this expression gives the right answer only in Cartesian coordinates. The tensor 
perspective offers both the explanation of why expression (6.48) fails and a fix. 

Equation (6.48) has several elements, of which all but one are proper from the 
tensor points of view. The collection of partial derivatives 8F/dZ ' is a tensor. The 
covariant tensor Z, is a tensor. Their product (3F/3Z- 7 ) Z, is a tensor of order two. 
However, the contraction is invalid because both tensors are covariant. The result, 
therefore, is not guaranteed to be an invariant. In fact, it is virtually guaranteed 
not to. 

The fix dictates itself. In order to produce an invariant we must mix covariant and 
contravariant tensors. The correct analytical expression for the gradient is 



(6.49) 


6.9 The Directional Derivative Identity 
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By the product and contraction theorems, this expression is an invariant. 
Furthermore, in Cartesian coordinates it reduces to 


„ dF 3 F 3 F 
VE = — ei + — e 2 + — e 3 , 
dx dy dz 


(6.50) 


which gives the correct vector. Since the combination in (6.49) gives the same 
answer in all coordinate systems and the correct answer in Cartesian coordinates, 
it must give the correct answer in all coordinates! 

This way of reasoning, which resorts to special coordinate systems, is frequently 
used in tensor calculus. It is often preferable and more geometrically insightful to 
furnish a proof that does not require particular coordinate systems. 


Exercise 101. Explain why the Laplacian AF of a scalar function F cannot be 
defined as 


AF = Z' 


3 2 F 

3Z'3Z? 


(6.51) 


Hint: Is 3 2 f 7 /3Z'3Z 7 a tensor? 


6.9 The Directional Derivative Identity 


Now that we have a solid analytical definition of the gradient, we are able to give 
the proper analytical derivation of equation (2.11) 

dF 

— = VE-L. (6.52) 

dl 

Suppose 1 is a straight line passing through the point P in the direction of the unit 
vector L. Suppose that 1 is parametrized by its arc length s: 

Z ' = Z' (s ) . (6.53) 

Then, along /, the function F (s) is given by the composition 


F (s) = F (Z ( s )) . 
Differentiating this identity by the chain rule yields 

dF _ dF dZ‘ 
ds 3 Z 1 ds 


(6.54) 


(6.55) 


The three elements of this equation can be identified with the three elements in 
equation (2.11). dF / ds is by definition the directional derivative dF/dl , dF/dZ 1 
is the gradient V,- F , and, finally, dZ' /ds represents the unit tangent L. Therefore, 
equation (2.1 1) is justified. 
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6.10 Index Juggling 

We have already encountered index juggling in Chap. 5 where we introduced the 
contravariant basis Z' as Z' ; Z y . This is an example of raising the index. In general, 
the raising of the index is the contraction of a variant with the contravariant metric 
tensor: 


T ] = fZ' 1 . 


(6.56) 


The variant T J is closely related to the variant 7}. In fact, although the elements of 
T 1 and 7} are different, they can be thought of as different manifestations of the 
same object. 

Contraction with the covariant metric tensor effects the lowering of the index. 
Not surprisingly, the two operations are the inverses of each other, so no information 
is lost or created by the operation. It is simply a notational convenience, but as far 
as notation conveniences go, it is on par with the Einstein summation convention in 
terms of effectiveness. The operations of lowering and raising indices are referred 
to collectively as index juggling. 

Any covariant index can be raised and any contravariant index can be lowered. 
Recall that in every tensor, the order of the indices must be clearly specified. When 
all the indices are either covariant, as in Tjjk, or contravariant, as in T ^ k , the natural 
order of the indices is easy to recognize: for 7 )jk and T ljk , i is the first index, j is 
the second, and A: is the third. 

When the flavors of the indices are mixed, as in 77 , the order of the indices is 
more arbitrary. For example, we may agree that in 77 , the upper index is first and the 
lower index is second. However, once the order is established, it must be maintained 
in subsequent computations. If indices can be juggled, a notational device is need 
to maintain the order of the indices. To illustrate the potential problem, consider the 
situations in which the upper index of 77 is lowered and the lower is raised: 


Tf = Tj Zj i Z jk . 


(6.57) 


Then what is the entry Tj ? Well, it is unclear whether 7 2 * is referring to the original 
variant or the one with the indices juggled. 

The solution is to allocate a slot for each index and use the dot • to spread out the 
indices. The original variant can be denoted by T‘. clearly indicating that the upper 
index is hrst and the lower is second. The upper part of the second slot occupied by 
j could also be marked with a dot T‘j, but the second dot is not necessary. If the 
hrst index is lowered, dots are not necessary since both indices are lower: 



(6.58) 
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Subsequently, if the second index is raised, the dot reappears in the first slot: 

T k l = T kj Z> 1 . (6.59) 

With this notation it is clear that T\ refers to the original form of the variant, T\i 
refers to the intermediate form, while 7j' 2 refers to the juggled form. 

There are several types of variants for which tracking the order of indices is 
handled by different means. First, are symmetric variants. If 


T tJ = Tji (6.60) 

then 

T'j = Tj' , (6.61) 

and it can therefore be written as Tj without confusion. Note, as discussed in the 
following section, the Kronecker symbol Sj with a lowered index is Z tJ . Thus, 
8]j = Sj and it can be denoted by Sj with no ambiguity. 

Finally, there are a number of special variants for which the order of indices 
is defined by convention example, for the Christoffel symbol Vj k , introduced in 
Chap. 8, we adopt the convention that the upper index is first. In practice, there does 
not arise a situation where the lower indices need to be raised. On the other hand, 
the first Christoffel index is commonly juggled, and the Christoffel index with the 
lowered first index is written as T, ^. Note that even the comma is unnecessary. 
However, some texts use the convention where the upper index is actually last. The 
comma therefore is meant to signal a casual reader of this book as to our convention. 
If the situation ever arises where second and third indices are raised, the • notation 
would be advisable. 

Exercise 102. Consider a coordinate system where the metric tensor at point A is 


ZiJ = 


2 -1 
-1 2 


(6.62) 


Suppose that T j kl is a object whose only nonzero value is 

^■212 = 7t - 


(6.63) 


Determine 7j 2 i2 and T 12 1 2 

Exercise 103. Show that the operations of raising and lowering indices are the 
inverses of each other. That is, if 


Si = S J Z jk , 


(6.64) 
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then 


S ' = SjZ jk . (6.65) 

Exercise 104. Show that the flavors of indices can be swapped in a contraction: 

S i T i =S i T i . ( 6 . 66 ) 

Exercise 105. Show that in a tensor identity, a live covariant (contravariant) index 
can be raised (lowered) on both sides. That is, if, for instance, 

S' = T jki U ]k , (6.67) 

then 

Si = T^U jk . ( 6 . 68 ) 

Exercise 106. Show that for symmetric tensors 7y ( T, j = 7},), we have 

T'. = T- j . (6.69) 

Therefore 

8*j=8'j, (6.70) 


as we are about to show. 


6.11 The Equivalence of 8 ‘. and Z (/ - 


Objects related by index juggling are considered equivalent. From this point of 
view, the Kronecker symbol and the metric tensor are equivalent, since 8‘- with the 
contravariant index is the covariant metric tensor: 

S'jZjk = Zjk- (6.71) 

Therefore, the object that could be denoted by 8jk is actually Zjk- Similarly, what 
is Zjj with the index raised? It is the Kronecker symbol, since 

ZijZ jk = 8 k . (6.72) 

Finally, raising the covariant index in 8'j yields the contravariant metric tensor: 

8)Z jk = Z ik . 


(6.73) 



6.13 Summary 
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Thus, Zjj, 8‘j, and Z IJ represent one and the same family of tensors related by 
index juggling. In fact, many texts denote Z, y by <5, ; - and Z' J by <5' 7 . Furthermore, 
some texts denote the Kronecker symbol by Z' . Since the Kronecker delta symbol 
is a form of the metric tensor, each of these choices are perfectly valid. On the 
other hand, will continue to denote the Kronecker delta by <51 and covariant and 
contravariant metric tensors by Z, ; and Z lJ . 


6.12 The Effect of Index Juggling on the Tensor Notation 

Index juggling leads to further compacting of the tensor notation. As a result of 
index juggling, the metric tensor — perhaps the central object in tensor calculus — 
almost never appears in tensor relationships explicitly. Instead, the contraction of the 
metric tensor with other variants is reflected in the location of the index. Consider, 
for instance, the dot product of two vectors U = U‘ Z, and V = V 1 Zj : 

U • Y = U'Zj ■ V j Zj = Z u U ‘ V j . (6.74) 

With index juggling, U • Y can be written as 

U • V = I/; V' , (6.75) 

or 

U V =U i V i . (6.76) 

Equations (6.75) and (6.76) are deceptively simple and make the dot produce 
appear computationally cheaper than it really is. For example, in three dimensions, 
equation (6.75) reads 


U- V = UjV 1 + U 2 V 2 + U 3 v\ (6.77) 

and appears to be three multiplications and two additions — that is, as cheap as if the 
basis was Cartesian. Of course, that is not the case, because a contraction with the 
metric tensor is required in order to obtain the covariant coordinates Uj out of the 
contravariant coordinates U' . Therefore, as we would expect, index juggling does 
not lead to more efficient computational algorithms. It does, however, lead to more 
attractive and compact notation that hides the non-Cartesian nature of the local basis. 


6.13 Summary 

This chapter introduced the paramount tensor property. The tensor property leads to 
the key recipe for constructing invariant expressions: stick to tensor expressions and 
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contract away all indices. We proved that some the essential objects in Euclidean 
spaces are tensors. These include Z, , Z', Z, ; and Z y . On the other hand, partial 
derivatives of tensors result in variants that are not tensors. Therefore, while we 
were able to give an invariant formulation for the gradient in equation (6.49), we 
were unable to give do the same for the Laplacian since the collection of second 
derivatives d 1 F/dZ' 3Z 7 is not a tensor. This problem is fixed by introducing the 
covariant derivative — a new operator that produces tensors out of tensors. It is the 
subject of Chap. 8. 


Chapter 7 

Elements of Linear Algebra in Tensor Notation 


7.1 Preview 

Linear algebra and tensor algebra are inextricably linked. The mechanics of these 
subjects are similar: linear combinations dominate calculations and change of coor- 
dinates (referred to as change of basis in linear algebra) is of primary interest. Some 
ideas are best expressed with matrix notation while others are best expressed with 
tensors. Matrix notation is more effective at expressing the overall algebraic struc- 
ture of a calculation. Tensor notation is a better choice when it is necessary to refer 
to the individual entries of an indexed set of values. For example, the inverse rule 

{AB)~ X = (7.1) 

is best expressed with matrices. Meanwhile, the minimization of a quadratic form 

-x T Ax — x T b (7.2) 

2 

should be analyzed in the tensor notation. 

This chapter is devoted to exploring the correspondence between these notational 
systems as well as highlighting the several unique insights that the tensor notation 
brings to the fundamental topics in linear algebra. 


7.2 The Correspondence Between Contraction 
and Matrix Multiplication 

The values of a variant in a given coordinate system form an indexed set of 
numbers. How one chooses to organize these numbers into a table is a matter 
of convention. For first- and second-order systems, we adopt a convention that is 
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consistent with that of linear algebra. A variant of order one, T' or 7} , is represented 
by a column. The flavor of the index (covariant or contravariant) has no bearing on 
the representation. A variant of order two, T t j , T ‘ , or T ' J , is represented by a matrix 
in which the first index indicates the row and the second index indicates the column. 
Thus, it is important to clearly identify the order of the indices in a variant. 

For a variant of order higher than two, there is no single commonly accepted 
convention. For variants of order three, one may imagine a three-dimensional cube 
of numbers where the first index indicates the slice and the other two indices indicate 
the row and the column within the slice. Another approach, employed by several 
software packages, works for any order. The idea is to organize the values as tables 
within tables. For example, the system T l} i ( = i 2 jk of order 3 can be represented as 



(7.3) 


In this 2x2 matrix of columns, the first two indices determine the row and the 
column, and the final index determines the position within the innermost column. 
Similarly, the fourth-order system = i 2 jkl can be represented by the table 



(7.4) 


The first two indices determine the row and the column of the overall table, 
and the remaining two indices determine the row and the column within the 
innermost matrix. 

The matrix notation works best for two-dimensional systems. A matrix is usually 
denoted by a single letter. The product A B of two matrices A and B implies a 
specific contraction of the entries of A and B, as well as a specific way of arranging 
the result. Consider the product 


C = AB. 


(7.5) 


In indicial notation, this product reads 

Qj =Y J A ikB kj - (7.6) 

k 

In words, the second index of A is contracted with the first index of B . The result 
of this contraction for fixed i and j is placed in the i th row and j th column of the 
resulting matrix. 
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Second-order systems can be contracted in other ways, too: 


«r 

w* 

II 

O' 

(7.7) 

Cij — Akj Bjk 

k 

(7.8) 

Cji = Y, AjkBkj . 
k 

(7.9) 

In order to capture these operations, the matrix notation offers two devices: the 
order of multiplicative terms and the operation transpose. The above identities are 
expressed in the matrix notation as follows: 

C = AB t 

(7.10) 

C = BA 

(7.11) 

C T = AB. 

(7.12) 

Finally, the matrix notation requires the operation of the trace 
contractions within a single square matrix. The trace is defined 
diagonal entries: 

in order to capture 
as the sum of the 

Tr A = Y A kk- 

(7.13) 


k 


In contrast with the matrix notation, the tensor notation can be used for systems of 
arbitrary order and includes only two operations: the tensor product and contraction. 
The order of the terms is immaterial and the explicit operation of transposition is 
absent. 

The matrix notation is more effective in situations in which the overall algebraic 
structure is important and when it is best to view matrices as indivisible units. 

Consider, for example, the proof of the fact that for two invertible matrices A and 

B , the matrix products AB and BA have identical eigenvalues. Suppose that X and 
x are an eigenvalue and a corresponding eigenvector of AB: 

ABx = Xx. (7.14) 

Multiply both sides on the left by the matrix B 

BABx = XBx (7.15) 


and denote Bx by y: 


BAy = Xy. 


(7.16) 
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Therefore, A is an eigenvalue of BA and the nonzero vector Bx is a corresponding 
eigenvector. 

The matrix notation is perfect for this kind of reasoning and truly brings out the 
essence of the argument. While it is possible to present the same argument in tensor 
notation, the simplicity of the argument would be obscured by the manipulation of 
the indices. In short, the matrix notation is preferred in situations where the focus is 
on the algebra. 

Example 107. Express the symmetric property 

A t = A 

in the tensor notation. 

Example 108. Express the product C = A T B in the tensor notation. 

Example 109. Express the product C T = A 7 B i n the tensor notation. 

Example 110. Express the product C T = A T B l in the tensor notation and explain 
why it follows that (AB) r = B r A T . This is an example of a theorem in matrix 
algebra that is best proved in the tensor notation. 

Example 111. Use the matrix notation to prove that, for invertible matrices A, B, 
and C, 

(ABC)~ l = C~ l B~ l A ] . (7.17) 


7.3 The Fundamental Elements of Linear Algebra 
in Tensor Notation 

Suppose that a basis {ei, . . . , e„} is chosen in the n -dimensional linear space. Denote 
the elements of the basis collectively by e,- . A vector v has a unique decomposition 
v' with respect to the basis e,- : 


v = v'e,. (7.18) 

The values v' are called the components of the vector v with respect to the basis e, . 
We use a superscript in v' and employ the Einstein summation convention. 

How do the components v ! transform under a change of basis? In essence, we 
already answered this question in Chap. 6 where we concluded that the components 
of a vector with respect to a covariant basis form a contravariant tensor. Our current 
goal is to translate those ideas into the matrix notation. 

Consider a new basis e,/ and suppose that the two bases are related by the 
identities 
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e,v = J/,e, (7.19) 

e, = jfm'. (7.20) 

In the Jacobians and Jf' , assume that the superscript is the first and the subscript 

is the second. Furthermore, denote the Jacobian Jf, by the matrix X. Then, since 
Jf, Jf ' = 8‘j , the Jacobian Jf' corresponds to X -1 . Note that in equation (7.19), the 
contraction takes place on the first index of X . Therefore, in the matrix notation, the 
basis e,' is given in terms of e, not by the matrix X but by the transpose X T : 

E' = X t E, (7.21) 

where E and E' are the basis vectors formally organized into a column: 


ei " 


"e,/ _ 


; e' = 


_ _ 


_e„'_ 


(7.22) 


From Chap. 6, we know that the components v 1 ' and v' are related by the 
identities 


v*' = Jf'v* (7.23) 

v 1 ' = Jf,/. (7.24) 

Translating these relationships to the matrix notation, we find that the components 
v' (v' is v' organized into a column) are given in terms of v (v is v' organized into 
a column) by multiplication with X~ l . Conversely, the components v are given in 
terms of v' by multiplication with X : 

v' = X _1 v (7.25) 

v = Iv'. (7.26) 

This is a vivid illustration of the term coufravariant: the components transform in a 
way opposite of the bases. 

In summary, tensor calculus expressions provide excellent mnemonic rules for 
establishing correct matrix rules for transformations under a change of basis. The 
foregoing discussion was not a derivation of equations (7.25) and (7.26) — the 
derivation took place in Chap. 6 — but rather a translation of the easy to remember 
equations (7.23) and (7.24) into matrix form. 

Equations (7.25) and (7.26) can also be derived directly in the matrix notation. 
Rewrite equation (7.18) in the matrix notation: 

v = v^E. 


(7.27) 
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In the equation above, insert, between v 7 and E, the identity matrix in the form 
(X T )~ l X T : 


v = v T ( X T ) 1 Z r E (7.28) 

and group the elements on the right-hand side as follows 

v = (X~ l v) T (X r E) . (7.29) 

Since X 7 E is E' , the combination X~ l v must be V , which is equation (7.25). 

A linear space equipped with an inner product (u, v) is called a Euclidean space. 
This is different from the sense in which we use the term Euclidean space in the rest 
of this book. The inner product can be evaluated in terms of components 

(u,v) = v 7 e y ) = (e,,ey) u'v j . (7.30) 

The matrix M with entries 

M u = (e, , e 7 ) (7.31) 

is called the Gram matrix. We may refer to M,j as the Gram tensor, but of course it 
is nothing more than the metric tensor Z, ; . It can even be used to juggle indices, as 
we do later in this chapter. In terms of the Gram tensor, the inner product reads 

(u,v) = M.jii'v 1 . (7.32) 


In the matrix notation 


(u, v) = u T Mv. 

For an orthonormal basis e, , the Gram matrix is the identity. 

Once again, tensor calculus offers a simple way to remember the formula for the 
transformation of the Gram matrix under a change of basis. By analogy with the 
metric tensor, M, ; is a doubly covariant tensor and therefore transforms according 
to the rule 


Mi,y = Mjj ji'Jy. (7.33) 

To express this identity in the matrix notation, first rewrite it as follows: 

M vr = .//■.)/;, .//.. 


(7.34) 


7.4 Self-Adjoint Transformations and Symmetry 
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Note that the first index of J‘, is contracted with the first index of M iy - and the 
second index of M, , is contracted with the first index of J ',. Therefore, the matrix 

J J 

equivalent of equation (7.33) is 

M' = X T MX. (7.35) 

Finally, we discuss the matrix representation of linear transformations. Suppose 
that v is the image of u under the linear transformation A: 

v = A (u) . (7.36) 

In the matrix notation, this relationship is represented by the matrix product 

v = Au. (7.37) 

In the tensor notation, since both u' and v' have superscripts, the natural way to 
express equation (7.37) is to let the first in index in the representation of A be a 
superscript and the other a subscript: 

v 1 ' = A)u J , (7.38) 

This identity immediately suggests that A 1 - is a tensor with one contravariant 
and one covariant dimension. This is indeed the case. Therefore A’j transforms 
according to the formula 

A), - A' (7.39) 

Thus, in the matrix notation, the matrices A! and A are related by 

A ' = X~ l AX. (7.40) 

As a direct consequence of the corresponding theorem in linear algebra, we 
conclude from equation (7.40) that the eigenvalues of a tensor A'j are invariant 
under a change of coordinates. 


7.4 Self-Adjoint Transformations and Symmetry 

A linear transformation A is called self-adjoint if, for any two vectors u and v, 

(Au, v) = (u, Ay) . (7.41) 

Self-adjoint operators are of central importance in applied mathematics, physics, 
and engineering. 

Self-adjoint transformations are said to be represented by symmetric matrices. 
In actuality, this is only true when the transformation is expressed with respect 
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to an orthonormal basis. The orthonormality of the basis is often implied or 
omitted altogether. If the basis is not orthonormal, the statement is generally untrue. 
Fortunately, the tensor notation alerts us to the potential danger. After all, a tensor 
A'j that represents the linear transformation has one superscript and one subscript. 
That is not a kind of system to which the term symmetric can even be applied in the 
usual sense. To clarify, a system Tjj is called symmetric if 7)/ = Tjj. What would 
symmetry mean for a system such as A 1 - ? 

The tensor notation will help us discover the correct general characterization 
of matrices that represent self-adjoint transformations. Express the inner products 
(Au, v) and (u, .4 v) in the tensor notation 


(Au, v) = Mjj A‘ k u k v J (7.42) 

(u, Av) = Miju‘A J k v k . (7.43) 


Reorder the terms and rename the indices in the second identity as follows: 

(u ,A\) = M ki A i j u k v i . (7.44) 

Since these identities hold for any u k and v J , we have 

MijA' k = MkiA'j. (7.45) 

Rename the indices so that i and j are the two live indices, and take advantage of 
the symmetry of Mjj : 


M ki A) = M kj A k . (7.46) 

Finally, rewrite equation (7.46) in the matrix notation: 

MA = A t M. (7.47) 

Equivalently, 

A = M~ 1 A t M. (7.48) 

This is a general statement characterizing a matrix A representing a self-adjoint 
transformation. It generally implies that A is not symmetric, unless the basis is 
orthonormal ( M = /). 

The tensor notation offers an elegant way to capture equation (7.48). Use the 
Gram tensor to lower the superscript on both sides of the equation (7.46). 


(7.49) 


7.5 Quadratic Form Optimization 
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What a nice identity! It states that, with the index lowered, the tensor A ;/ - is 
symmetric. Of course, that is not equivalent to the matrix A being symmetric 
because the matrix A has the entries A '■ , not A (/ - . When we raise the index i on 
both sides of equation (7.49), we obtain another elegant identity equivalent to (7.49) 

A[ j =A i j . (7.50) 

The tensor on the left has the same entries as the matrix A. The tensor on the 
right does not. Equation (7.49) illustrates the importance of maintaining or carefully 
ordered list of indices. 


7.5 Quadratic Form Optimization 


Quadratic form optimization is a problem in which access to the individual entries of 
a matrix is necessary in order to apply calculus. A quadratic form is a multivariable 
generalization of a parabola with a linear shift: 


/ (x) = -ax 2 ~ bx. 


(7.51) 


When a > 0, / (x) has a minimum at 

b 

x = — 
a 

since 


f (x) = ax — b. 


(7.52) 


(7.53) 


In the case of N variables X| , . . . , x ; y organized into a column called x, the 
quadratic form is given by the matrix expression 

/ (x) = -x T Ax — x T b, (7.54) 

where A is a symmetric positive definite matrix and the entries of 


b = 



bn 


(7.55) 


are constants. The positive definiteness of A is analogous to a > 0 in 

equation (7.51). It is a sufficient and necessary condition in order for / (x) to 
have a well-defined minimum. The minimum of / (x) occurs at x given by 


Ax = b. 


(7.56) 
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which is seen to be entirely analogous to (7.52). Our goal is to derive equation (7.56) 
by the methods of calculus: that is, by equating the partial derivatives of / (x) to 
zero. 

Exercise 112. Linear algebra offers a derivation that does not require calculus. 
Introduce a new vector r = A~Kx — b and express / (x) in terms of r. Show 
that the minimum of the new quadratic form occurs at r = 0. 

Exercise 113. Without the tensor notation, the calculus approach proves surpris- 
ingly cumbersome. Derive the partial derivatives of 

1 , 1 , 

/ (*> y) = 2 AllX + AnX y + 2 Al2y - blX - biy ( ' 7 ' 57) 

and show that this calculation leads to equation (7.56). 

Exercise 114. Perform the same task in three dimensions for / (x, y, z). 

We now use the tensor notation to carry out the calculation outlined in the these 
exercises. Denote the independent variables by x‘ , the matrix A by A ( A ,-y = Aj,), 
and the vector b by bj . The quadratic form (7.54) is given by the expression 

/ (x) = -Aijx l x J —bjX 1 , (7.58) 

where we have customarily suppressed the index of the function argument. Let us 
calculate the partial derivative of / (x) with respect to x k . By the product rule, 


3/ 1 3x' ■ 1 ■ 3x 7 dx' 

dx 1 ~ 2 + 2 ' jX 3x^ ~ bl dx*' 

The partial derivative 3x ! / dx k is captured by the Kronecker symbol: 


(7.59) 


dx' 

dx k 


= 8i. 


(7.60) 


Thus, 


df_ 

dx k 


\Aijiixi + l -A ij x i 8i-b i 8 i k . 


which simplifies to 


(7.61) 


3f_ 

dx k 


1 

2 


Akj x~i -f- 


-A ik x l - b k . 


(7.62) 


7.6 The Eigenvalue Problem 
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Since Ajj is symmetric, the first two terms on the right-hand side are equal: 


¥_ 

dx k 


AkjX b k - 


Equating the partial derivatives 3/ /dx k to zero yields 

A ki x' = b k . 


which, of course, is equivalent to equation (7.56)! 
Exercise 115. Show that the Hessian 3 2 f /dx 1 <)x J equals 

H- = A . 
dx' dxJ ‘ J 


(7.63) 


(7.64) 


(7.65) 


7.6 The Eigenvalue Problem 

In this section, we show that, much like Ax = b , the eigenvalue problem 

Ax = XMx (7.66) 

can be formulated as a variational problem. The matrix M is assumed to be 
symmetric and positive define. The variational formulation is to find the extrema of 

/ (x) = Ajjx' x J (7.67) 

subject to the constraint that 

M u ; c'V = 1 . (7.68) 

Geometrically, equation (7.68) states that the vector x' unit length. Use a Lagrange 
multiplier A to incorporate the constraint in the augmented function E (x. A) 

E (x, A) = A ij x‘x j - A (Mijx'x-i - l) . (7.69) 


Following earlier analysis, 


1 3 E (x) 


= . 1 » / .v ? — A MjjX-i . 


2 dx 1 


(7.70) 
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Equating the partial derivatives to zero yields 


Ajjx J = \Mjjx J 


(7.71) 


which is equivalent to the eigenvalue problem (7.66). 
Exercise 116. Show that equation (7.71) can be written as 


Aijx 1 = Ax,- 


(7.72) 


and as 


A‘jX J = Xx J . 


(7.73) 


Exercise 117. Show that the eigenvalues of the generalized equation (7.71) are 
invariant with respect to change of basis. 

Exercise 118. Show that the eigenvalues of the generalized equation (7.71) are 
given by the Rayleigh quotient 


A = Ajjx'x 1 . 


(7.74) 


7.7 Summary 

In this chapter, we discussed the interplay between tensor calculus and linear 
algebra. We discovered that a number of concepts in linear algebra are clarified with 
the help of tensor notation. In particular, the matrix rules for the transformations of 
vector components, the Gram matrix, and the matrix representing a linear operators 
can be determined from the tensor properties of these objects. On the other hand, 
tensor calculus draws on several fundamental results from linear algebra. One of 
those results in the invariance of eigenvalues under similarity transformations. In 
Chap. 13, the invariance of eigenvalues plays an important role in the study of 
curvature. 


Chapter 8 

Covariant Differentiation 


8.1 Preview 


Chapter 6 demonstrated that the tensor property is the key to invariance. However, 
a partial derivative 3/3 Z' of a tensor is itself not a tensor. This is a major obstacle 
in the way of developing differential geometry using the coordinate approach. For 
example, the expression 3 T' / 3 Z' cannot be used as a definition of divergence since 
it evaluates to different values in different coordinates. Similarly, Z' 7 d 2 T /3Z' 3Z 7 
is not invariant and is therefore not a legitimate definition of the Laplacian. 

This difficulty was overcome by Gregorio Ricci-Curbastro and his student Tullio 
Levi-Civita (Fig. 8.1) in their celebrated work Methods of Absolute Differential 
Calculus and Their Applications [34] in the year 1900. Levi-Civita later wrote a 
classic textbook on the subject entitled The Absolute Differential Calculus [28]. 

The solution comes in the form of a new differential operator, V, , the covariant 
derivative. Similarly to the partial derivative, the covariant derivative satisfies the 
familiar sum and product rules. Furthermore, it has the desirable property that it 
produces tensors out of tensors. The resulting tensor is one covariant order greater 
than the original tensor. The contravariant derivative V' is defined by raising the 
index on the covariant derivative: 


V' = Z' 7 V; . 


(8.1) 


The new operator has a number of other desirable properties. One of those 
key properties is that covariant differentiation reduces to the partial differentiation 
in Cartesian (more generally, affine) coordinates. This enables us to define the 
divergence of a tensor field T' as 


v,r 


(8.2) 


and the Laplacian of the invariant field T as 


Z' 7 V, V ; T. 


(8.3) 
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1 ?•/£ . Xcsy _ C^M. 


Fig. 8.1 Gregorio Ricci-Curbastro (1853-1925), son of a famous engineer, is considered the 
inventor of tensor calculus. Tullio Levi-Civita (1873-1941) was Ricci-Curbastro’s student. 
A superb geometer, he was one of the key figures in mathematical foundations of general relativity. 
Many interesting insights into Levi-Civita’s life can be found in [33] 


On the one hand, these expressions are invariant. On the other hand, they coincide 
with the usual definitions in Cartesian coordinates. Therefore, these equations are 
valid in all coordinates. 

The Christoffel symbol defined in Chap. 5 plays a central role in covariant differ- 
entiation. The following motivating example illustrates the role of the Christoffel 
symbol. Following the motivating example, we present the formal definition of 
the covariant derivative and devote the rest of the chapter to establishing its key 
properties. 


8.2 A Motivating Example 


Consider an invariant vector field Y defined in the Euclidean space referred to 
coordinates Z' . Then the new object obtained as the partial derivative 


_9Y 

dzJ 


(8.4) 


is a tensor (with vector components), as was demonstrated in Chap. 6. Decompose 
V with respect to the covariant basis Z, : 


V = v‘ Zj . 


(8.5) 
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Substituting this expansion in equation (8.4) and applying the product rule, we 
obtain 


_3V 

dzJ 


dv '■ 

= az7 z '- + K 


, az, 

8zj ' 


(8.6) 


The partial derivative 3Z, / 3Z 7 of the covariant basis can, of course, be expressed in 
terms of the Christoffel symbol T,* . This is, in fact, the very definition of T^- which 
was first given in equation (5.56) and repeated here 


3Z i 

dzJ 


r*jZ k . 


(5.56) 


Thus 


3V 

dZJ 


8V‘ 

dZJ 


z, + 7'r* z,. 


(8.7) 


Our next goal is to factor out Z, , so that what remains in parentheses can be 
interpreted as the components of 3V/3 Z 7 with respect to the covariant basis. 
Rename the dummy index A: to i in the second term. In order to avoid using the 
index i twice in a single expression, also rename the already present dummy index 
i to m . Then factor out Z, : 


_3V 

dzJ 


( 8V [ \ 

(az7 + Vj z, 


( 8 . 8 ) 


We are nearly done. In a final step, take advantage of the symmetry of the Christoffel 
symbol with respect to the two lower indices to rewrite this expression as 


d\ 

3Z7 


dV‘ 

dzJ 


+ 


r 



z„ 


Now comes the key point: the expression in parentheses 


8V f 

3 zJ 


+ 


r' v m 

jm 


(8.9) 


(8.10) 


is a tensor. This is because it represents the components of a tensor with respect to 
the covariant basis Z, . (In Sect. 6.3.5, we showed that the component of a vector 
with respect to Z, is a contravariant tensor.). It is interesting that neither term in this 
expression is a tensor, yet they combine to produce one. 

The expression 3 V /dZ J + Y' jm V" is a better measure of the rate of change 
in the tensor V than 3 V /8Z J alone. The quantity 3 V' /8Z’ captures the rate 
of change in the component V . But it also takes the (changing) covariant basis 
Z, to rebuild V from V 1 . For example, the vector field Y may be constant but, in 
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curvilinear coordinates, the components V vary. Conversely, a constant tensor field 
V in curvilinear coordinates corresponds to a varying vector field Y. However, 
that partial derivative dV /<)Z J only measures the rate of change of the tensor field 
dV /d Z J and ignores the change in the basis Z, . 

The combined expression dV /3Z 7 + T' m V m captures the rates of change of 
both elements that contribute to Y: the component V and the covariant basis Z, . 
And, as equation (8.9) shows, the tensor combination dV' /3Z J + T' m V m represents 
precisely the components of the rate of change in the vector field V. We therefore 
propose the following definition of the covariant derivative V, of a contravariant 
tensor V : 


VjV 


dV 

dzJ 


+ 


r i v m 

jm 


(8.11) 


This definition needs to be modified for covariant tensors. If V is decomposed 
with respect to the contravariant basis Z' 


V = V, Z' 


( 8 . 12 ) 


then we find 


av 

dZJ 


' dVj 

dzJ 


— r m v 

1 ij v " 


(8.13) 


We similarly conclude that the combination dVj / dZ ’ — T'" V m is a tensor and that 
it perfectly represent the rate of change of the vector held V. Thus, we define the 
covariant derivative V 7 of a covariant tensor Vj according to 


VjVt = ^J~ r d Vm - ( 8 - 14 ) 

The dehnitions (8.11) and (8.14) are quite similar. The differences are the minus 
sign in the case of the covariant tensor and index on which the Christoffel symbol is 
contracted with original tensor. 

Exercise 119. Derive equation (8.13). 

Exercise 120. Explain why V ; V; is a tensor. 

Exercise 121. In Exercise 82, you were asked to show that the acceleration of a 
particle moving along the curve Z' = Z' (t ) is given by 

dV 1 ■ . 

A' = — + r l jk V J V k . (8.15) 


Conclude that A' is a tensor. 


8.3 The Laplacian 


109 


8.3 The Laplacian 

We now present the Laplacian operator — a central differential operator in applied 
mathematics. This discussion also illustrates the variant property of the covariant 
derivative. In Chap. 6, a variant was defined as an object that can be constructed 
by a similar rule in various coordinate systems. The covariant derivative satisfies 
this important requirement: it can be interpreted by a consistent algorithm across all 
coordinate systems. 

The Laplacian A F of an invariant field F is usually defined by its expression in 
Cartesian coordinates: 


A F 


3 2 F 3 2 F 
dx 2 3 y 2 


(8.16) 


This definition immediately raises a number of questions, the first one being whether 
this expression evaluates to the same value in different Cartesian coordinates. The 
answer is yes, but to show this outside of the tensor framework requires a lengthy 
calculation. Nevertheless, (8.16) is a legitimate definition and is our starting point. 

In tensor calculus, the Laplacian is defined by the expression 


A F = Z' 7 V,- V y - F. 


(8.17) 


So far, we have defined the covariant derivative only for tensors of order one. 
For invariants, such as F , the covariant derivative is defined simply as the partial 
derivative 


V,-F = — . (8.18) 

3Z' 

This property is discussed below in Sect. 8.6.2. 

Later in this chapter, when we have proved the metrinilic, product, and contrac- 
tion properties of the covariant derivative, the definition of the Laplacian is captured 
by an even more compact expression 

AF = V,'V'F. (8.19) 

The central point is the invariance of the expression in (8.17). It is an invariant 
because all elements of the expression are tensors. It is therefore guaranteed to 
produce the same value in all coordinates. This simple point is the central idea in 
tensor calculus and its importance cannot be overstated. 

The other important point is the algorithmic nature of the covariant derivative and 
the definition (8.17). The expression in (8.17) can be interpreted the same way in all 
coordinate systems and in each case produces an expression in terms of conventional 
partial derivatives that is valid in that particular coordinate system. 
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In Cartesian coordinates, the Christoffel symbol vanishes and the covariant 
derivatives reduce to partial derivatives: 


AF = Z'i - — — — r. (8.20) 

dZ'dZJ 

The contravariant metric tensor Z' J has two nonzero components Z 11 = Z 22 = 1. 
Therefore, among the four terms in (8.20), only two survive: 


In other words. 


3 2 F | 3 2 F 

8Z l dZ l + 3Z 2 3Z 2 ’ 


3 2 F 3 2 F 

F ~ Jx 2 + ~dy 2 ' 


(8.21) 


( 8 . 22 ) 


Thus, in Cartesian coordinates, the definition (8.17) agrees with the conventional 
definition (8.16). We can also conclude that the expression (8.22) evaluates to the 
same value in all Cartesian coordinate systems. 

We now calculate the expression for the Laplacian in polar coordinates. We 
mention at the outset that a more effective way to calculate the Laplacian is 
expressed by the Voss-Weyl formula discussed in Sect. 9.8. 

Recall the nonzero entries of the Christoffel symbols in polar coordinates 
calculated in Chap. 5: 


r l 22 = -r (5.77) 

r?2 = r 2! = (5.78) 


The function F is now assumed to be expressed as a function of the coordinates 
Z 1 = r and Z 2 = 9. Since the covariant derivative coincides with the partial 
derivative for invariants, the values of V ; F are 


VjF 

V 2 F 


3 F 
dr 
3 F 
30 ' 


By definition, the second-order covariant derivative V, V , F is given by 


V/ V F = 


dVjF 


3 2 F 


p AC y r 

3Z' i] k ~ dZ'dZJ 


r - rf. 


3F 


,J 3 Z k ' 


(8.23) 

(8.24) 


(8.25) 
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This indicial identity captures the following four relationships: 


V,ViF = 
V,V 2 E = 
V 2 ViF = 


Plugging in the values of the Christoffel symbol, we find 


3 2 F k 3 F 


Jr 2 ~ 11 3Z^ 

(8.26) 

3 2 F k 3 F 


drdd Tu dZ k 

(8.27) 

3 2 * * F k 3 F 


dOdr r 21 3Z A ' 

(8.28) 

d 2 F k 3 F 


Je 2 ~ 22 azz' 

(8.29) 


VjVjF = 


Vi V 2 E = 


VjVjF = 


V 2 V 2 F — 


3 2 F 
dr 2 

a 2 F 1 3 F 

(8.30) 


(8.31) 

drdd r 3 6 

3 2 F 1 3 F 



(8.32) 

dddr r 3 6 

3 2 F d F 

h r — . 

ae 2 dr 

(8.33) 


Notice that V i V 2 F and V 2 Vi F are equal. This is an example of the commutative 
property of the covariant derivative. Commutativity is discussed in Sect. 8.6.4. 

Finally, recall the nonzero entries Z 11 = 1 and Z 22 = of the covariant 
metric Z' 7 : 


Z' 7 V,-V ; F = Z 1 1 V, Vi Z 7 + Z 22 V 2 V 2 F = 


d 2 F i n 2 F a f\ 

dr 2 r 2 \ 3 6 2 3 r) 


(8.34) 


or 


Z iJ V/ S7j F 


d 2 F 1 3 F 
dr 2 + r dr + 


i a 2 F 

^Je 2 ' 


(8.35) 


An essential point of this discussion is that the expressions (8.16) and (8.35) for 
the Laplacian in Cartesian and polar coordinates are different in terms of partial 
derivatives but can both be obtained from the tensor definition (8.17). 

Exercise 122. Show that in cylindrical coordinates in three dimensions, the Lapla- 
cian is given by 


3 2 F 13 F 1 3 2 F 3 2 F 

Z' ; V- V ■ F = l l l 

1 1 dr 2 r dr r 2 3 9 2 dz 2 


(8.36) 
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Show that this expression is equivalent to 

^ 1 3 / 3F\ 1 8 2 F d 2 F 

‘ j ~ 7a7 v 97 ) + V 1 Je 2 + W 


(8.37) 


Exercise 123. Show that in spherical coordinates in three dimensions, the Lapla- 
cian is given by 

„ d 2 F 2dF 13 2 F cot 6 dF 1 3 2 F 

Z V,- VjF — + - — — I- - 2^2 ^ + 2 ■ a Tja' (8.38) 

dr z r or r z ou z r z 06 r z sint7 d(p z 

Show that this expression is equivalent to 




/ 2 3F\ 1 3 j , 

V ik ) + Tarsus { sme 


1 


3 2 F 


dF\ 

dO ) r 2 sin 2 6 3 (p 2 


(8.39) 


Exercise 124. Derive the expression for the Laplacian in two dimensions in affine 
coordinates X and Y related to the Cartesian coordinates x and y by stretching: 


X = Ax 
Y = By. 


(8.40) 

(8.41) 


8.4 The Formula for V,- Z j 

The discussion of the Laplacian served as a good illustration of the mechanics of the 
covariant derivative. We would now like to give another such illustration, in which 
the covariant derivative is applied to the covariant basis Z, in polar coordinates. This 
calculation illustrates the metrinilic property of the covariant derivative discussed 
in detail below in Sect. 8.6.7. As we have been consistently emphasizing, tensor 
operations work equally well for tensors with scalar and vector elements. The 
covariant derivative is yet another example of this. 

Let us begin by calculating the vector V 1 Z 1 . By the definition of the covariant 
derivative, we have 

ViZ t = -i-rfjZi. (8.42) 

or 

Here, we would like to point out an important characteristic of the covariant 
derivative: each element of V, Z y - involves all elements of Z, . This is unlike the 
partial derivative, where dZ\/dr can be computed by itself, without referring to Z 2 . 

As we established in Chap. 5, Zj is a unit vector that points directly away from 
the origin. Therefore, for a given 0, does not change with r and we have 
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9Zi 

= 0. (8.43) 

or 

Furthermore, since both entries rfj of the Christoffel symbol vanish, we have 

ViZj = 0. (8.44) 

Next, let us calculate V 1 Z 2 . First, we write out the definition: 

ViZ 2 = -l-ri 2 Z k . (8.45) 

or 

The vectors Z 2 are orthogonal Zi and their length equals r. Therefore, for a given 
value of the coordinate 6 , the vectors Z 2 all point in the same direction and their 
length grows linearly with r. Thus 9Z 2 9r is the unit vector pointing in the same 
direction. Since r' 2 = 0 and r 12 = r 1 [see equation (5.78)] we find that the 
combination T k 2 Z k is the unit vector that points in the direction opposite of Z 2 and 
therefore 


VjZ 2 = 0. (8.46) 

We now turn to V 2 Z] which is given by 

V 2 Z! = - V k 2l Z k . (8.47) 

The rest of the calculation for V 2 Z! is contained in the following exercises. 

Exercise 125. Using geometric arguments, explain why 8Z\/86 is the unit vector 
that is orthogonal to Z\. 

Exercise 126. Conclude that 


V 2 Zj = 0. (8.48) 

Exercise 127. Give an alternative derivation of this identity by showing the general 
property that V,-Z j is a symmetric object 

V/Z; = V ; Z ; . (8.49) 

Hint: dZj/dZ' = 9 2 R/9Z'9Z^' and r*. = Tjj.. 

Exercise 128. Show that 


V 2 Z 2 = 0 . 


(8.50) 
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We have just shown that all elements of V, Z 7 in polar coordinates are zero: 

V,-Z; = 0. (8.51) 

Importantly, this is not a special property of polar coordinates, but true in all 
coordinates. This is an important feature of the covariant derivative. Note that one 
of the main reasons that Cartesian coordinates are so convenient in many problems 
is that the coordinate basis i, j is unchanged from one point to another. Operating in 
curvilinear coordinates sacrifices this desirable property. However, at least from the 
algebraic point of view, the covariant derivative restores it! 

This important property of the covariant derivative can be demonstrated in several 
ways. Once we prove that tensor property of the covariant derivative, we will be 
able to conclude that V, Z j vanishes in all coordinate systems because it vanishes in 
a particular coordinate system. (Note, that we could have shown even more easily 
that V, Zj = 0 in Cartesian coordinates.) This is a common way of showing that a 
tensor vanishes. It was justified in Exercise 94 in Chap. 6. 

This property can also be easily demonstrated by a direct algebraic calculation. 
By definition, 

9Z 

V,-Z 7 - = dz J - rfj Ik - (8.52) 

However, by the very definition of the Christoffel symbol given in equation (5.56), 

3Z; , 

az7 " r ^- (8 - 53) 

Therefore 

V,-Z; = 0. (8.54) 

Exercise 129. Show similarly that 

V t Z J = 0. (8.55) 

We now turn to the task of defining the covariant derivative for variants of order 
greater than one. 


8.5 The Covariant Derivative for General Tensors 

The definitions (8.11) and (8.14) can be extended to variants with arbitrary 
collections of indices. The general definition includes a Christoffel term for each 
index. We give the complete definition for a variant Tj with one covariant and one 
contravariant index. It is clear how this definition extends to variants with other 
combinations of indices. For Tj , the definition reads 


8.6 Properties of the Covariant Derivative 
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(8.56) 


As you can see, for each index there is a term in which 77 is multiplied by the 
Christoffel symbol and the index is contracted with the appropriate index from the 
Christoffel symbol. 

Let us apply the recipe encoded in equation (8.56) to the triply covariant tensor 
Tijk. In additional to the partial derivative dT rst /dZ k , we have a Christoffel term 
with a minus sign for each covariant index: 



(8.57) 


Exercise 130. Determine the expression for the covariant derivative V/ Tj, J . 
Exercise 131. Show that 


= o. 


(8.58) 


This is a special case of the more general metrinilic property of the covariant 
derivative discussed below in Sect. 8.6.7. 

In the next section we discuss the key properties of the covariant derivative. 
These properties make the covariant derivative as simple to use as the partial 
derivative. However, its tensor property make it much more powerful than the partial 
derivative. When an analysis is carried out within the tensor framework that employs 
covariant differentiation, that analysis is simultaneously valid in all coordinate 
systems. Therefore, the tensor approach offers the best of both worlds: it utilizes 
the advantages of coordinate systems while producing results that are independent 
of coordinate systems. 


8.6 Properties of the Covariant Derivative 
8.6.1 The Tensor Property 

The key property of the covariant derivative and, in fact, the very motivation for its 
invention, is that it produces tensors out of tensors. The implications of this property 
are crucial and far reaching. For example, the fact that 


(8.59) 


is tensor implies that 
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Z l] V; Vj = V 7 Vj 


(8.60) 


is an invariant and is therefore a suitable definition for the divergence of a tensor 
field Vj . Similarly, the fact that V, Vy F leads to the definition of the Laplacian as 
we saw in Sect. 8.3. 

In this section, we would like to present an elegant argument that proves the 
tensor property of the covariant derivative. We have already demonstrated the tensor 
property for the contravariant component V of an invariant vector field Y. However, 
any tensor or order one can be treated as the component of an invariant field and that 
is the basis of the proof. 

Suppose that V is a contravariant tensor. Then V = V Z, is an invariant and its 
partial derivative 



is a tensor. We repeat equation (8.13) 



(8.13) 


and again use the argument that the combination dV /dZ J + V' jm V" must be a 
tensor since it represents the component of a tensor field (3V / dZ J ) with respect to 
the covariant basis Z Therefore, Vy V = dV /dZ J + T l jm V m is a tensor. 

Exercise 132. Prove that Vy Vj is a tensor by forming the invariant V, Z' and 
analyzing its partial derivative with respect to Z 7 . 

1 believe that this elegant proof of the tensor property leaves little doubt that, for a 
tensor T' with vector elements, the covariant derivative produces a tensor. However, 
the argument doesn’t quite work since the contraction T'Z, is meaningless. Instead, 
one needs to consider the scalar invariant T formed by the dot product T = T' • Z, 
and this argument is left as an exercise. 

Exercise 133. For a contravariant vector T' , prove that V, T' is a tensor by forming 
the invariant T = T' • Z, . 

Exercise 134. For a covariant vector T, , prove that Vy T, is a tensor by forming the 
invariant T = T, • Z' . 

So far, we have demonstrated the tensor property of the covariant derivative for 
tensors of order one with scalar and vector elements. This opens up an elegant way 
for proving the tensor property of the covariant derivative for tensors of order two. 
For example, suppose that 7)y is a covariant tensor of order two. Then Ty = 7y Z' 
is a covariant tensor of order one for which the tensor property of the covariant 
derivative has already been established. Therefore, 


VfcTy 


(8.62) 
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is a tensor. By the product rule, which is proved below in Sect. 8.6.6, 

Va-T j = Vi (TijZ*) = V t 7y Z' + 7y V*Z f . (8.63) 

Since V^Z' is zero, we have 

ViT, = V, T,j Z 

and if follows that V/. 7y is a tensor since it is the covariant component of a tensor 
field VfcTy . 

Exercise 135. Show similarly that V/. Tj and V/- T' J are tensors. 

Exercise 136. Show, by forming dot products with the covariant and contravariant 
bases, that V^T jj, , and V/.-T'- 7 are tensors. 

Exercise 137. Argue the general tensor property by induction. That is, if T‘ J f' is 
a tensor, then V p T'f" is also a tensor. 

This concludes our discussion of the paramount tensor property of the covariant 
derivative. The proof present in this section is quite elegant and robust. However, 
it is also of great value to be able to demonstrate the tensor property more directly 
by showing that the covariant derivative of a tensor transforms under a change of 
variables in an appropriate way. This is accomplished below in Sect. 8.7 


8.6.2 The Covariant Derivative Applied to Invariants 

The covariant derivative V/ f coincides with partial derivative d/dZ k for invariants, 
or any variants of order zero. Rather than labeling this fact as a property of the 
covariant derivative, it is better viewed as a reasonable extension of definition (8.56) 
to variants of order zero. Indeed, definition (8.56) instructs us to include a Christoffel 
term of each index. Since variants of order zero do not have any indices, the right- 
hand side reduces to a single term with the partial derivative. This property is used 
quite frequently in tensor calculations when there is a need to relate partial and 
covariant derivatives. 


8.6.3 The Covariant Derivative in Affine Coordinates 

The covariant derivative V* coincides with partial derivative d/dZ k in affine and, 
in particular, Cartesian coordinates. This is true because the Christoffel symbol 
vanishes in affine coordinates. This is an important property and we have already 
used it to a great effect in a proof of the metrinilic property. It is frequently 
used in this way: a certain characteristic is demonstrated in affine coordinates and 
subsequently extended to general coordinates by the tensor property. This way of 
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reasoning was illustrated in Exercise 94 in Chap. 6. Another great example of a proof 
like this comes in the very next section where we show that covariant derivatives 
commute. It is a fundamental property of partial derivatives, but it proves much 
more subtle and geometrically insightful for covariant derivatives. 


8.6.4 Commutativity 

Covariant derivatives commute, that is 


VtV/7! = V/V^r! (8.64) 

and this property holds for tensors with any collection of indices. The difference 

ViV/Tj - V/Vi-rj. (8.65) 

is often written as (V*-V/ — V/V*) T‘ in order to emphasize the switching of the 
operators. The operator Vj. V/ — V/ V/ ; is commonly referred to as the commutator. 

The cornerstone of the upcoming argument is that the difference (8.65) is a 
tensor. Furthermore, in Cartesian coordinates, it reduces to the difference of partial 
derivatives 


d 2 T ) d 2 T I 

dz k dz' - dz‘dz k (8 ' 66) 

and therefore vanishes due to the commutative property of the partial derivative. 
Consequently, being a tensor, it must vanish in all coordinate systems. 

Given how easy it is to justify, commutativity may seem to be a relatively 
unremarkable property. However, this couldn’t be further from the truth. For 
example, in Sect. 8.129, we use this fact to show that the object R k mi j = 
9 r k m / 3Z — dT k m /dZ' + r k jn r n lm — Y k n r'j m is a tensor and must therefore vanish 
in all coordinates system. This is quite a remarkable relationship which can be used 
for many purposes. For example, given a variant G‘- k of order three, it can be used 
to determine whether G'j k can serve as the Christoffel symbol for some coordinate 
system. 

Furthermore, the commutative property of the covariant derivative and the cor- 
responding property that R k mi j = 0 no longer holds in non-Euclidean spaces. Non- 
Euclidean spaces arise in many fundamental applications that include differential 
geometry on curved surfaces and Einstein’s general relativity theory. We study the 
differential geometry on curved surfaces starting with Chap. 10. 
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8.6.5 The Sum Rule 

The covariant derivative satisfies the sum rule 

V* (T) +Sfj = V A - Tj + Va S) . (8.67) 

This rule holds for tensors with arbitrary collections of indices and its proof is left 
as an easy exercise. The sum rule is easily generalized to linear combinations 

Va (a Tj + bS) ) = a W k Tj + b W k S) , (8.68) 

where a and b are constants. There’s very little more that can be said about the sum 
rule and we are therefore moving on to the product rule. 

8.6.6 The Product Rule 

The covariant derivative satisfies the product rule also known as Leibniz’s law. The 
product rule reads 


Va ( V Uj ) = Va V Uj + V V k U } (8.69) 


and holds for tensors with arbitrary collections of indices. To demonstrate the 
product rule for S' Tj, apply the definition of the covariant derivative 

, . . 3 IPUj) 

v * (r = + r LT m Uj - r %T’ U m . (8.70) 

Since the partial derivative 3/3 Z k satisfies the usual product rule, we have 


y A (T'Uj) = ^u J + r ^ + T[ m T m Uj - r %ru m 

Group the first term with the third and the second with the fourth 

(d T \ . / dUi 

I — ^ 4- n r m 1 U: + t' I _ r™ u„ 


Vi ' {TlUj) = + r km Tm ) Uj + r ( 


(8.71) 


(8.72) 


and note that the parenthesized expressions are V k T' and Va Uj . This completes the 
proof. 
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Note that the product rule also applies in the case of the dot product: 


V* (T • U,-) = V k T ■ Uj + V ■ WkUj. 


(8.73) 


The proof of this property is left as an exercise. 


8.6.7 The Metrinilic Property 

The metrics Z, , Z', Zy, Z ,J and 5'- vanish under the co variant derivative. This is 
the metrinilic property. For the bases Z ,• and Z' , we have already encountered this 
property in Sect. 8.4, where it was demonstrated in two different ways. Furthermore, 
this property was shown for the Kronecker symbol Si in Exercise 131. 

For the metric tensors Zy and Z 1] it can be shown by a simple application of the 
product rule. For example, for the covariant metric tensor Zy , we have 


V kZjj — (Z, • Z j) — V^Z/ ■ Z j + Z, • V^Z j — 0. 


Exercise 138. Show the metrinilic property for the contravariant metric tensor Z y 
by the product rule. 

Exercise 139. Show the metrinilic property for the Kronecker symbol S'j by the 
product rule. 


Exercise 140. Show the metrinilic property for the delta symbols S l J s and 8'/^ by 
expressing them in terms of the Kronecker delta symbol Si . 


Exercise 141. Show the metrinilic property for the delta symbol S'/ s by a direct 
application of the definition 


Exercise 142. Use the property that V^Zy = 0 to show that 



(8.74) 


Exercise 143. Use the property that Z' J = 0 to derive the expression of 
3Zy/9Z*. 
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Let us summarize the metrinilic property: 


V;Zi, Vy Z' = 0 

(8.75) 

Va Zjj. V;Z- = 0 

(8.76) 

v,s;. , v p s;i, v p s% = o. 

(8.77) 


In Chap. 9, we encounter the Levi-Civita symbols e^ k and Sj jk that also belong 
to the group of metrics. The metrinilic property extends to the Levi-Civita symbols 
as well: 


V p e ijk , V p e ijk = 0. 

The metrinilic property has far-reaching implications. The first impact is on the 
essential operation of index juggling. We have grown accustomed to raising and 
lowering free indices on both sides of an identity. For example, we know that 

T t =S; j U kl V k , (8.78) 

is equivalent to 

S i = T ii U kl V kl . (8.79) 

Let us remind ourselves of why this true: the latter identity can be obtained from 
the former by contracting both sides with the contravariant metric Z' r (and then 
renaming r — > i). 

What happens, however, when the index that we intend to juggle occurs under 
the covariant derivative? For example, consider the identity 

Si = Vfc T/ U kl V,. (8.80) 

Contracting both sides with Z' r yields 

S r = Z ir V k T i j Uj l Vi. (8.81) 

Can Z ,r be passed under the covariant derivative so that 

S r = S7 k (z lr T/ ) U kl V, ? (8.82) 

Thanks to the metrinilic property the answer is yes. By the product rule, we have 

Z ; ' V;. r : j = Va- (z''T/J - Va Z ir Tj (8.83) 

and the second term vanishes by the metrinilic property, yielding 

Z ir SJ k T i 


Va (z iT 7/) . 


(8.84) 
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This shows that metric tensors pass freely across the covariant derivative. Thus, 

S r = V k T ri Uf'Vi , (8.85) 

or, by renaming r — > i , 

S' = S7 k T ij ufVi ? (8.86) 

We have therefore shown that free indices can be juggled across the covariant 
derivative. 

We have also grown accustomed to the fact that contracted indices can exchange 
flavors. For example, 


S)T? = S i} T ik . (8.87) 

Is this operation still valid if the exchange takes place across the covariant derivative, 
as in 


S’j V, 7f = Sij V/ T ik ? (8.88) 

Once again, thanks to the metrinilic property, the answer is yes and the proof is left 
as an exercise. 

Exercise 144. Show that contracted indices can exchange flavors across the covari- 
ant derivative. 

In addition to being able to freely juggle indices on the arguments of the covariant 
derivative, we are able to freely juggle indices on the covariant derivative itself. For 
example, the expression 


r'V,S/ (8.89) 

is clearly equivalent to 

Tj V' S J k . (8.90) 

Furthermore, 

TjVyV'Sf = rVjViSf. (8.91) 

Finally, the Laplacian A F = Z' J V, V y F can be written compactly as 

Af = V, V' F. (8.92) 

Written in this way, it is quite reminiscent of its Cartesian origins as the sum of 
second derivatives. We do not need to state again, however, just how powerful 
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equation (8.92) is by virtue of being valid in any coordinate system. Since the 
symbol V, V' is so compact, from this point on we will is it instead of the more 
conventional symbol A. 

Another important consequence of the metrinilic property is that we are able 
to study vector fields by analyzing their components. Suppose that V is the 
contravariant component of the vector field V: 

V = V'Zj. (8.93) 


Then 


3v 

az^ 


VjfcV = v k v z,. 


(8.94) 


Therefore, we see that the tensor V k V carries all of the information regarding the 
rate of change of the vector field V. By contrast, the partial derivative 3 V' /dZ k fails 
to capture the change in the accompanying covariant basis. 


8.6.8 Commutativity with Contraction 

Covariant differentiation commutes with contraction. Consider the expression 

V*7T. (8.95) 

It can be interpreted in two ways. First, it can be seen as the covariant derivative 
applied to the tensor Sj = T 7 . The tensor Sj is of order one. Its covariant derivative 
Vi Sj is given by 


V* Sj = 


dSj_ 
3 Z k 


r m <: 

1 jk^rr, 


Therefore, in this interpretation, V/, T‘- reads 


377 

V7 r pi -pm ’T'l 

V kl ij ~~ 


(8.96) 


(8.97) 


Alternatively, V k T‘- can be interpreted as the covariant derivative applied to 
the tensor 7T of order three to produce the tensor V/. T' r - of order four which is 
subsequently contracted on i and r. Following this interpretation, the tensor V/ ; T' rj 
is given by 
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Now, let us contract both sides on i and r: 



977, 


(8.99) 


The terms are T l m - correspond to the two indices in Tjh that we 

contracted. These terms are equal as can be seen by exchanging the names of the 
indices i and m in one of them. Therefore, these terms cancel and we find that the 
second interpretation of the expression V/ ; 7V- yields 


AT' 



( 8 . 100 ) 


which is equivalent to the contraction-first interpretation. 

Thanks to the contraction property there is, in fact, no ambiguity in the expression 
V*; T-j . Had this not been the case, the Einstein notation would have been viable 
and there would have been a need to reintroduce the summation sign to distinguish 
between 



( 8 . 101 ) 


This contraction property completes the list of the key properties of the covariant 
derivative. As we see, despite its apparent complexity, the covariant derivative is 
simply a better alternative to the partial derivative. 

The next section is devoted to a very important exercise: proving that the tensor 
property by deriving the rule by which the covariant derivative transforms under a 
change of variables. 


8.7 A Proof of the Tensor Property 

The marquee property of the covariant derivative is that it produces tensors out of 
tensors. We have established this property in Sect. 8.6.1 by an elegant argument. 
In this section, we will give a more direct proof based on establishing the rule by 
which the covariant derivative transforms under a change of variables. We show this 
property for three types of tensors: a covariant tensor 7} , a contravariant tensor T l , 
and a tensor of order two T‘ . We do not give a proof for a tensor with an arbitrary 
collection of indices, but it will be apparent to the reader that the proof can be 
extended to those tensors, as well. 
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8.7.1 A Direct Proof of the Tensor Property for Vy J,- 

Consider the covariant derivative Vy T, for a covariant tensor 7} . In the primed 
coordinates Z 1 ' , Vy/7}/ is given by 

3 T 

^ j,Ti ' = W 7 ~ T ‘'i’ Tk '- (8 ' 102) 

We have previously established how each of the terms in this expression transforms 
under the change of coordinates. We repeat those relationships here 


3 T, 
dZi' 

f-7 

+ 

^ N 

\ CO 

II 

(8.103) 

T k ' 
t Vj , 

= + 4 y jf 

(8.104) 

Tk' 

= T,4 

(8.105) 


The last two relationships help establish the transformation rule for the contraction 


T i'j' T k' 


if , ,tv - (r 


+ 4j,J«) T, J[, = T* J:'jfT k + 4j,T k , (8. 106) 


where the justification of the second equality is left as an exercise. Thus, combining 
both terms in equation (8.102), we find 


V/'7> 


dTi ji jJ . ji J. 
d Z j + i' 1 


r j 1 j J 

l ,j j , J j 


T k 


i k 

J;r j 


T k . 


The terms with the second-order lacobians cancel and we find 



which is precisely 

which shows that Vy 7) is a tensor. 


(8.107) 


(8.108) 


(8.109) 


8.7.2 A Direct Proof of the Tensor Property for Vy T 1 

The proof for the contravariant tensor T‘ will require an additional step. In 
coordinates Z' , Vy'7’ 1 is 
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dr 


V rT’ — 

] ~ azJ' 


7 J + Fyk'T ■ (8-HO) 

Let us catalog how each of the terms transforms under the change of coordinates: 

dr' dT 


_ T J J l -U r T< J 1 

dzJ' dzJ j' '' + v y 
r', = r Ljj'jUt, + 


j'k' 

r rk r 


jk J i 

t'j?. 


( 8 . 111 ) 

( 8 . 112 ) 

(8.113) 


From the last two relationships we find 


i'k' 


"=(1 


+ J 


%' J f) T l Ji = iV/7/,r* + 


(8.114) 


Combining all terms we have 
dT 


VyT 1 


dzj Jyjf + Tij u j j' + r %JfJj' Tk + J k' J !' J i' Tl - , 8 -ii.r» 


The first and third terms combine to yields V y - T‘ J 1 ., J‘ . However, this time, the 
fact that the terms containing the second-order Jacobians cancel is not immediately 
clear. Factoring out T l (this requires some index renaming), we find 


v,r = v,-r/ 


' i 1 ’ + ( r' 


jj j k 

-t j j, k , j j jj 


j r. 


(8.116) 


To show that the expression in parentheses vanishes, start with the identity 


j' j 1 

J i J i' 




(8.117) 


and differentiate with respect to Z' : 

y' jj 4. i 1 ' jj i k ' — o 

Thus, the nontensor term indeed drops out and we have 


(8.118) 




V - T' Z' Z J 
1 ' j ' 


(8.119) 


proving that Vy T' is indeed a tensor. 
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8.7.3 A Direct Proof of the Tensor Property for V* Tj 


We now turn to the proof of the tensor property of the covariant derivative when 
applied to a tensor Tj . In the coordinate system Z' , V^/Tj, reads 




dZ k ' 


+ r 


k'm 


'-pm 

'V 


pffl rpl 

1 j'k' 1 m' ’ 


( 8 . 120 ) 


Let us determine how each term transforms under the change of coordinates. For 
dTj',/dZ k ' , we have 


T‘ — T‘ V T 1 

1 r - 1 j j ‘ 

By a combination of the product rule and the sum rule, we find 


3 T i 3 T 

IL _ J_ Ti' jj jk I rpi ji ' jj jk , pi jj' jj 

dz k ' - dZ kJ ’ j' k ' + 1 j JikJ j' J k' + l i J i J i’k ’■ 




For the second term T‘ k , m ,Tp , we have 


’ i ' 7 -•m' ( T-'i ji’ jk jm , ji j i'\ pr jm' jj 

k'm 11 j' — \ l km J i J k' J m' ^ J k'm' J i ) I j J > J j' 


Multiplying out, 


j ' ; ' pm' T ■ i pm ji' jk ts , /■/ ji ji' jm' jj 

1 k’m’ 1 i’ 1 km 1 i J i J k' J j> 1 j J k'm' J i J r J 


Finally, for the last term Tf, k , Tj n ,, we have 


pm' pi' ( 

1 j'k' * m' ~ ^ 


pm jm' jj jk , jm rm'\ pi ji' js 
1 jk J m J j’ J k'-rJj'k' J m ) 1 s J i' J m'- 


(8.121) 


( 8 . 122 ) 


(8.123) 


(8.124) 


(8.125) 


Expand the expression on the right hand side: 


t-'h/ pi' pi jj jk ji' I pi ri' jm 

1 j'k^m' — 1 jk 1 m J j' J k' J r ^ 1 m J i J j'k >• 


(8.126) 


Combining the terms in equations (8.122), (8.124) and (8.126), we recognize 
the terms from the proofs of the tensor property of V, 7) and V, T J . Namely, 
the leading terms from each equation combine to give precisely V/. Tj Jj' JjrJf. 
Among the remaining terms, the last term in equation (8.122) cancel the second 
term in equation (8.126). This is analogous to the cancellation that took place in 
the proof of the tensor property of V, Tj . The other two terms, Tj Jj k jj, J k , from 

equation (8.122) and Tj Jj/ m , Jj Jf' Jj, from equation (8.124) combine to give 
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Tj (44 + Jf) 4’ ( 8 - 127 ^ 

where, as before, the last step required renaming some of the dummy indices and, 
as before, we conclude that this term vanishes. We have therefore shown that 

Vk'T)', = V A -r; jfjj, J k , (8.128) 


which proves the tensor property of V A Tj . 


8.8 The Riemann-Christoffel Tensor: A Preview 


The Riemann-Christoffel tensor is a crucially important object in tensor calculus 
and has an interesting history. The concepts that are captured so beautifully by 
the Riemann-Christoffel tensor were conceived by the German mathematician 
Bernhard Riemann (Fig. 8.2) and first described in his masterpiece [35], Of course, 
[35] predates tensor calculus by 35 years and does not use the tensor notation 
championed in this book. In fact, Riemann felt that his ideas were best expressed in 
words rather than in equations. As a result, [35] contains very few equations which 
makes it a challenging read to say the least. If you can read German [36] or Russian 
[37], you can find a detailed commentary — in tensor notation — by Hermann Weyl 
in the 1919 edition of Riemann’s collected works. 

The Riemann-Christoffel tensor arises in the analysis of the commutator V, V ; — 
V y V, . It is straightforward to show that the identity 


ViVjT k -VjViT k 


'dr k 

jm 

3Z' 


dr ? 1 

A i YU 1 p& p/2 

' 1 in 1 jm 


dZj 


p/C p/2 

1 jn 1 im 


(8.129) 


Fig. 8.2 Bernhard Riemann 
(1826-1866) was a pivotal 
figure in the development of 
modern geometry 
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holds for any tensor T k . Thus the expression in parentheses is a tensor. (Demonstrate 
this.) This tensor is known as the Riemann-Christoffel tensor R k mi j 


Dt _ 
■mij 


art 


J m , pA: p/2 pA: p/i 

i 37 ; ' 1 in L jm 1 jn 1 inf 


3 Z 


~<k 
im 

dZJ 


(8.130) 


This object allows us to write the commutator V, V / T k — V ; - V, T k very compactly: 


V- V/ T k - V , V, T k = R 


k rjitn 

■mij 1 * 


(8.131) 


Since the covariant derivatives commute in a Euclidean space, the Riemann- 
Christoffel symbol vanishes 


R k mij = 0. (8.132) 

This equation is a powerful statement about the relationship among the entries of 
the Christoffel symbol. 

The Riemann-Christoffel tensor takes center stage in Part II, which is devoted 
to the tensor description of curved surfaces. In Chap. 12, the profound significance 
of the Riemann-Christoffel tensor becomes apparent. Furthermore, the Riemann- 
Christoffel tensor plays a central role in Einstein’s general relativity in which the 
ambient space is assumed to be non-Euclidean. 

Exercise 145. Derive equation (8.129). 

Exercise 146. Show that the Riemann-Christoffel tensor vanishes in a Euclidean 
space by substituting equation (5.60) into equation (8.130). After an initial applica- 
tion of the product rule, you will need equation (5.65) to re-express the derivative of 
the contravariant basis. 


8.9 A Particle Moving Along a Trajectory 

This section presents a problem that is best solved by a developing its own mini- 
calculus. A situation such as this arises quite frequently in practice. The problem 
discussed here is usually presented in the context of parallelism along a curve in a 
Riemann space. 

Consider a particle moving in a Euclidean space, referred to a general coordinate 
system Z' , along the trajectory T (t ) given by 

Z' = Z'(t). (8.133) 

Our goal is to calculate the components of the particle’s velocity, acceleration, and 
jolt (rate of change in acceleration, also known as jerk and surge). 
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In order to determine the components of the velocity vector, note that the position 
vector R of the particle is given by the composition 

R(0 = R(Z(0) (8.134) 

of the functions in equations (5.1) and (8. 133). Differentiating equation (8.134) with 
respect to t , yields 


or 


Thus 


V (t) = 


9R dZ‘ 

az 7 ”*”’ 


v(0 = 


dZ l 



V (t) = 


dZ' (t) 
dt 


(8.135) 


(8.136) 


(8.137) 


We note that V (t), as well as derivatives R*"* (t) of all orders of R (?) with 
respect to t, are invariants with respect to the change of coordinates Z', since the 
differentiation of R ( t ) with respect to t does not even involve the coordinates Z' . 

Exercise 147. Argue, on the basis of V being a tensor, that V is tensor with respect 
to the change of coordinates Z' . 

Exercise 148. Show, by relating V 1 ' to V , that V 1 is a tensor with respect to the 
change of coordinates Z 1 . 

Exercise 149. Show that dV /dt is not a tensor with respect to the change of 
coordinates Z' . 

To obtain the components A‘ (t) of acceleration A (t), differentiate equa- 
tion (8.136) once again with respect to t . 

Exercise 150. Show that 


A (0 = (^ + r ) k V J V k \ Z, , (8.138) 

thus 

dV' • , 

A 1 (0 = — + r‘ jk V J V k . (8.139) 

Exercise 151. Argue that the combination on the right-hand side of this equation is 
a tensor. (Hint: A is an invariant). 
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Equation (8.139) can be used as a motivation for the following definition of the 
S/St -derivative for a general variant T' defined along the trajectory y ( f): 

ST' dP , , 

ir = -dr + VJ V (8 - 140) 

This definition can be extended to tensors of arbitrary indicial signature as follows: 


ST]_ 

St 


dT 

dt 


1 + v k r 1 


j/k T~*m ti 

V V kj L m * 


(8.141) 


The S/St -derivative can also be referred to as the intrinsic derivative. 

Exercise 152. Suppose that T' is the contravariant component of a vector field T 
that is constant along the trajectory y. Show that ST' /St = 0. 

This exercise gives insight into the term parallelism. If a vector T travels 
unchanged along the trajectory T then its components T' , with respect to the 
changing basis Z;, are characterized by the equation ST' /St = 0. 

The S/St -derivative has all of the desirable properties discussed in chapter. The 
following exercises summarize these properties and invite you to use the newly 
developed calculus to obtain the expression for components of the jolt J. 

Exercise 153. Use the techniques developed in this chapter to show that the 5/St- 
derivative satisfies the tensor property: that is, the 8 / 8 1 -derivative produces tensors 
out of tensors. 

Exercise 154. Show that the 8/ St -derivative coincides with the ordinary derivative 
d / dt when applied to variants of order zero. 

Exercise 155. Show that the S/S t -derivative coincides with the ordinary derivative 
d/dt in affine coordinates. 

Exercise 156. Show that the 8/ St -derivative satisfies the sum and product rules. 
Exercise 157. Show that the 8/ St -derivative commutes with contraction. 

Exercise 158. Suppose that Tj is defined in the entire Euclidean space. Show that 
the trajectory restriction of Tj satisfies the chain rule 


STj dTj (t, Z) 
St dt 


(8.142) 


Exercise 159. Show that the S/S t -derivative is metrinilic with respect to all metrics 
associated with the coordinate system A' . 

Exercise 160. Rewrite equation (8.138) in the form 
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( 8 . 143 ) 


Apply the 8 / 8 1 -derivative to both sides of this equation to obtain the following 
expression for the jolt J: 



( 8 . 144 ) 


Thus 


( 8 . 145 ) 


St 2 ' 


8.10 Summary 

In Chap. 6, we discussed the crucial importance of the tensor property in forming 
geometrically meaningful invariant objects. However, we discovered that partial 
differentiation of a tensor results in a variant that is not a tensor. This problem had 
to be overcome in order to preserve the tensor calculus framework. The solution 
presented in this chapter comes in the form of a new differential operator, the 
covariant derivative. 

The covariant derivative truly saves the day! It preserves just about all important 
properties of the covariant derivative and provides the crucial benefit that it 
produces tensors out tensors. The invention of the covariant derivative allowed 
the development of tensor calculus to go forward. The definition developed in 
this chapter will serve as a prototype for other differential operators that produce 
tensors out of tensors, include the covariant differentiation on surfaces as well as 
the invariant time derivative on moving surfaces. 


Chapter 9 

Determinants and the Levi-Civita Symbol 


9.1 Preview 

I have been looking forward to writing this chapter because we now get to use the 
machinery we have been constructing. And to what great effect! Tensor calculus is a 
fantastic language for determinants. The subject of determinants is beautiful and is 
too often obfuscated by cumbersome notation. Even some of the most fundamental 
properties of determinants can be difficult to follow without tensor notation. If 
you ever found the proof that the determinant of a product equals the product of 
determinants less than satisfying, you will find your satisfaction in this chapter. 

You are well aware of the importance of determinants for solving linear systems, 
computing volumes, changing variables in volume integrals, and evaluating vector 
products and the curl of a vector field. In tensor calculus, determinants also play 
a special role. There is a kind of symbiotic relationship between determinants 
and our subject: the tensor notation makes working with determinants easy while 
determinants give us the Levi-Civita symbols and help us approach integration from 
an invariant point of view. 

We will begin with the permutation symbols. The determinant of a second- 
order system can then be defined in terms of the permutation symbols. We will 
then demonstrate the key properties of the determinant, in tensor notation. We 
will also introduce the delta symbols, which are nifty little objects of tremendous 
utility. The ultimate highlight of the chapter is the Voss-Weyl formula, which 
gives a convenient expression for the divergence, and therefore the Laplacian, that 
bypasses the covariant derivative. Finally, we will present the volume element and 
the Levi-Civita symbols, rounding out our collection of metrics. 
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9.2 The Permutation Symbols 

Our discussion will initially be limited to three dimensions. Consider a 
skew-symmetric system A^k. Skew-symmetry refers to the property that an 
exchange of any two indices changes the sign of the entry, whether it is the first two 
indices 


A ijk = —Ajik, 


(9.1) 


the last two 


Aijk = -Aikj, 


(9.2) 


or the first and the third 


Aijk = —Akji. (9.3) 

It immediately follows that if two of the indices have the same value, then the 
corresponding entry in Ajjk is zero. For instance, applying equation (9.1) to the 
values of indices i = j = 1 and k = 2, yields 

^4ii2 = — Am (9.4) 

and it follows Am = 0. We therefore conclude that the entries of A,jk are not zero 
only for indices that form a permutation of numbers 1, 2, and 3. There are 6 entries 

^4 123 1 A 132 , ^ 213 - ^ 231 - 4 312 , A 32 l, (9-5) 

but only one degree of freedom: each entry is either ^4 123 or — A 123 . These values are 
determined by the skew-symmetry conditions (9.1), (9.2), and (9.3). Let a denote 
A 123 . Then 


2 I 123 = ^312 = ^231 = a (9.6) 

^213 = ^321 = ^4 132 = —a. (9.7) 

As you can see, there isn’t much variety in skew-symmetric systems: once the value 
of A 123 is selected, the remaining five nonzero entries follow. When o' = I the 
system is denoted by etjk and is called a permutation symbol. Let us catalog its 
values in a format that generalizes to higher (and lower) dimensions 

( 1 if i jk is an even permutation of 1,2,3 

— 1 if ij k is an odd permutation of 1, 2, 3 . (9.8) 

0 otherwise 
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The same system can be denoted by the symbol e ,jk with upper indices. The 
placement of the indices does not matter until we discuss transformation of 
coordinates. 

Exercise 161. Show that equation (9.3) follows from equations (9.1) and (9.2). 
Exercise 162. Evaluate the contraction e ljk eijk- 
Exercise 163. Evaluate e' jk ekij . 

Exercise 164. What are the entries of the system f ji ; = e^k + ^ jik - 
Exercise 165. What are the entries of the system fjk = e^k + e/ a/ ? 


9.3 Determinants 


Consider a system a'j of order two. In this section we are not concerned with the 
variant or tensor properties of a'. . That is, it is unimportant — for now — according 
to what rule a'j was constructed. Consequently, the placement of indices does not 
carry any tensorial information and we picked an object with one upper and one 
lower index simply as a convenience. The determinant A of a'j is defined as 

A = e ljk ajajal. (9.9) 

You should convince yourself that this definition is consistent with the definition 
given in linear algebra. It is certainly more compact, but the compactness comes 
at the expense of including a multitude of terms that equal zero. The usual linear 
algebra definition includes 3! terms. Equation (9.9), on the other hand, has 3 3 terms, 
all but 3! of them zero. The surviving terms are the same as in the usual definition 
and the permutation symbol e ,jk assures that they are counted with the correct sign. 

Our first goal is to replace equation (9.9) with an equivalent one that conforms to 
all the conventions of tensor calculus: 


A = ^e' ]k e rs ta r i a s j a t k. (9.10) 

To show the equivalence of equations (9.9) and (9.10), note that switching the values 
1 and 2 in the original definition results in the same value of opposite sign: 


e iik a}a)a\ = e ijk a)afal 


e iik a)a)al 


-e ijk a)a)al 


(9.11) 


This observation can be applied to any pair of indices. In other words, the 
e'i k a\a S ja! k is skew-symmetric in r, s, and t. As we have previously learned, a 
third-order skew-symmetric system has a single degree of freedom, which can 
be identified as the entry corresponding to r,s,t = 1.2,3. By definition, for 
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these values of the indices, e l,k a\ a 5 -a l k is precisely A. This observation can be 
summarized as follows: 


e iik a r i a s j a t k = Ae rst . (9.12) 

This is already better since we have succeeded in eliminating explicit indices. 
What we need to do now is isolate A on the right-hand side. In Exercise 162 you 
discovered that e rst e rst = 3!. Therefore, contracting both sides of this equation 
with e rsl , we arrive at equation (9.10). From this point forward, we will treat 
equation (9.10) as the definition of the determinant. 

Definitions similar to (9.9) can be given for systems cijj with lower indices and 
a 1] with upper indices: 


A = e ,jk a u a 2 ja 3k (9.13) 

A = e ijk a h a 2j ci 3k . (9.14) 

and it can be similarly shown that 

A = ^ e l]k e rst a ir a js a kt (9.15) 

A = j [ e i j k e rst a ir a ]S a kt . (9.16) 


Once again, the placement of the index is only a matter of convenience until the 
transformation of A and underlying systems is analyzed. This will be discussed 
later in this chapter. 

Exercise 166. Explain each step in equation (9.11). 

Exercise 167. Derive equations (9.15) and (9.16). 


9.4 The Delta Systems 

The fullest delta system 8‘/ s k is defined as the tensor product of two permutation 
symbols 


8 


ijk 

rst 


e ijk e rst . 


(9.17) 


Using this new object, the determinant A of a'j can be written more compactly as 


A = jjK J st a i a j a k- 


(9.18) 
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Furthermore, by multiplying both sides with ei mn , equation (9.12) leads to the 
following expression 


x‘J k n r n s n t 

a imn a i a j a k 


A8[ 


rst 
mn ’ 


(9.19) 


which will prove important in showing the product property of the determinant. 
This equation is quite appealing, even if it doesn’t say anything beyond what we 
have already learned. Also, in a way, it may also be the most of all tensor identities 
we have experienced so far. After all, it represents 27 identities, each containing 729 
terms on the left-hand side. 

The symbol S'/ s k t is only one of a whole group of delta symbols S'., S l j s and S'/ k t . Of 
course, we have already encountered 81, the Kronecker symbol. All delta systems, 
including their generalizations to higher dimensions, can be captured by a single 
definition: A particular entry of a delta system has value 1 when the upper and 
lower are identical sets of distinct numbers related by an even permutation, — 1 if 
the sets are related by an odd permutation, and 0 otherwise. In other words, the 
entry is 0 if any of the upper or any of the lower indices repeat, or if they do not 
represent identical sets of numbers. You should be able to see that this definition 
correctly describes the familiar objects 81 and S'/ s \, for which alternative definition 
exist. 

Thus, the value is 0 if any upper or lower indices are equal and, in the case of 
distinct sets, if those sets are not the same. For example, 8\\ = 0 because the two 
upper indices are the same and S\i = 0 because the upper and lower indices do not 
consist of the same numbers. As a final example, < = 1 and 8f = — 1 because, in 
each case, the upper and lower indices form identical sets of distinct numbers. For 
(5j|, the upper indices are an even (zero, actually) permutation away from the lower 
indices while for Si] the permutation is odd. 

It is evident that delta systems are skew-symmetric in its upper and its lower 
indices. The full delta system S'JH can be expressed in terms of the Kronecker 
symbol by the determinant 


8 ijk 

u rst 


ft X* 

u r u s u t 

si si si 

si si si 


(9.20) 


The most important consequence of equation (9.20) is that S' Hi is a tensor. 
Consequently, according to equation (9.18), the determinant of a second-order 
tensor with one covariant and one contravariant index is an invariant. The same 
cannot be said of tensors with two covariant or two contravariant indices. The tensor 
properties of determinants of such tensors are discussed later in this chapter. 

Delta systems of different orders are related by the following contractions: 

% = €1 

2 8 \ = 


(9.21) 

(9.22) 
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Proofs of these relationships are left as exercises. Equations (9.21) and (9.22) show 
that all delta systems are tensors. These relationships extend to higher dimensions. 
For example, in four dimensions. 


C = 5 


2 sn = s 


| ijkl 

rstl 

(9.23) 

.ijk 

rsk 

(9.24) 

\ ij .. 

rj 

(9.25) 
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The symbol S'/ s can be expressed in term of the Kronecker symbol by the following 
frequently used identity 

#J s = &\8i-VX (9.26) 

The right-hand side of equation (9.26) can also be captured by a determinant: 


8 ,J = 


8 ' 8 ' 
r 

si si 


(9.27) 


Exercise 168. Justify equation (9.20). 


Exercise 169. Explain why equation (9.20) shows that 8' r { k t is a tensor. 

Exercise 170. Explain why it follows that the determinant of a tensor with one 
covariant and one contravariant index is an invariant. 

Exercise 171. Justify equations (9.21) and (9.25). 

Exercise 172. Justify equations (9.23) and (9.25) in four dimensions. 

Exercise 173. Generalize equations (9.23) and (9.25) to N dimensions. 

Exercise 174. Justify equation (9.26). 


Exercise 175. Evaluate 8; 


ijk 


Exercise 176. Evaluate S' 


ijk- 

ikj 

jir' 


Exercise 177. Evaluate 8:,, 8 


'jk orst 


rst u lmn ‘ 


Exercise 178. Evaluate S‘ r { k t : 8J£f n S'/J)" . 


9.5 A Proof of the Multiplication Property of Determinants 


The multiplication property of determinants reads 

\MN\ = \M\\N\. (9.28) 

In words, the determinant of a product of two matrices equals the product of their 
determinants. Three different proofs of this property can be found in linear algebra 
textbooks including [13,27,44], and [22], The tensor derivation is based on the 
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approach in [13], and [22]. However, the proofs without the tensor notation lead 
to cumbersome expressions, while the repeated use of the symbol ^ a ii permutations 
makes those proofs difficult to follow. 

The tensor notation translates the logic of permutations into simple algebraic 
manipulations. Suppose that the double system c‘- is the “matrix product” of a'j 

and b k : 


c ‘j =<*>)■ 


(9.29) 


Then its determinant C is given by 


C — —X ijk r r r s r t 
L °rst c i c j c k- 


Substitute equation (9.29) in (9.30): 

C — — 8 ijk a r h l a s h m n t h n 
C — y 0 rst a l o i a " 1 a n°k' 

Since, by equation (9.19), 8‘/ k t a s m a' n = AS'jjf , we have 


1 


C = -A8’ jk b'b’fbl, 

-^1 Imn i J k ’ 

which gives 

C = AB. 

The entire derivation can be combined in a single line 

um„t UK 


3! 


(9.30) 


(9.31) 


(9.32) 


(9.33) 


C = = U} k a\b\a s m b'y\bl = ±Ag£ H b\bJbt = AB. (9.34) 


This proof is a demonstration of the effectiveness of the tensor notation. 

Exercise 179. Show C = AB property for systems cl, a 1 ' , and bjk related by 
4 = a^b jk . 


9.6 Determinant Cofactors 

If the term a\ is removed from the product 8'J k t a] a s j a’ k the result is a second-order 
system of tremendous utility. Define the cofactor A‘ r by 


A i — _L K'i k s 1 

A r ^i°rst a j a k- 


(9.35) 
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For a tensor a , the cofactor A\. is also a tensor. The cofactor A\. has two remarkable 
properties. First, it represents the partial derivative dA/da' r of the determinant A 
with respect to its entries. Second, the product A~ l A' r is the matrix inverse of a\. 
To prove the first relationship 


ito? = A '- <9 - 36) 

compute the derivative of A with respect to a", since the indices i and r are already 
used in equation (9.35). We have 


dA 1 ijk d ( a W<) 

da 1 ; 3! TSt da " 


(9.37) 


By the product rule. 


— - 

da; ~ 3! TSt 


( da i s t r 9<3 y t r S da k 


(9.38) 


Each of the partial derivatives in (9.38) can be captured by the Kronecker symbol. 
For example, 


M - rtf 

da; 


(9.39) 


Therefore 


dA 

da; 



SMa’ai + Stfja'ai + S'Xa'a’). 


(9.40) 


Multiply out this expression and use the index-renaming property of the Kronecker 
symbol: 


dA_ 

da; 


1 (x 

ill*' 


^ j 1 bust ® 


U k „s t , silk r t 
i a k + O rll t a i a k 




The terms in parentheses are each equivalent to 2A l u . Therefore 


dA_ 

da; 



(9.41) 


(9.42) 


which completes the proof. 
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We next show that the cofactor A' r is proportional to the matrix inverse of the 
system a \ : 


Ala 1 ', = A8‘ 


(9.43) 


Denote the product A' r a' w by D' m : 


1 


D ' m = A Wm = jfis,<y j a t k- 

We first show that D' m = 0 when i ^ m. For example, consider D\ 


D\ = 


2 ! 


rst u 2 u j u k- 


(9.44) 


(9.45) 


Out of the 3 5 terms in the sum on the right-hand side, only two (/. k = 2, 3 and 
j, k = 3,2) correspond to nonzero entries of : 


nl 1 . el32 _r „s \ 

™ 2 ~ 2 \ \°rst a 2 a 2 a 3 + °rst a 2 a 3 a 2j • 


(9.46) 


However, each of these terms is zero because the value 2 appears twice as the lower 
index. Therefore, D' m = 0 for i ^ in. 

Next consider the case i = m. For example, consider D \ : 


nl — —8^ k n r n s n t 
u \ — 2\ rst a l a j a k- 


(9.47) 


The same two terms correspond not nonvanishing values of 8 

nl 1 /el23 ~s „t , o 1 32 

D 2 — \8,st a \ a 2 a 3 + °>-st a 2 a 2 a \) ■ 


1 jk. 


(9.48) 


This time, each equals A and we have therefore shown that D’ m = A for i = m. 
This completes the proof of the fact that A’ r is A times the inverse of a\ . 

Exercise 180. Explain equation (9.39). 

Exercise 181. Show that each term in equation (9.41 ) equals 2A". 

Exercise 182. Generalize the proof of equation (9.36) to arbitrary dimension. 

Exercise 183. Derive similar expressions for determinants of tensors and a lJ . 

Exercise 184. Explain why each term vanishes on the right-hand side of equa- 
tion (9.46). 


Exercise 185. Explain why each term in parentheses in equation (9.48) equals A. 
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Exercise 186. Extend the subject of cofactors to systems with two lower indices 
(denote the cofactor by ^4"' ) and systems with two upper indices (denote the cofactor 
by A ir ). 


9.7 The Object Z and the Volume Element 

Let Z denote the determinant of the covariant metric tensor Z, ; - : 


Z = |Z..| . 


(9.49) 


We have used the letter Z to denote several objects: the independent variable Z', 
the bases Z and Z' , the metric tensors Zy and Z' J , and now the determinant of Zy . 
However, there is no confusion since the index signatures and the font make these 
symbols different. Interestingly, some texts use the same letter to denote the position 
vector R, the Jacobian matrices Jl, and J‘ and the upcoming Levi-Civita symbols 
Sijk and s ,jk . The combination of index signature and font would still enable us to 
distinguish among all objects Z, Z',, zf , Zjjk, Z ljk , and allow us to label all of 
the most fundamental objects by the same letter. In this book, however, we limit the 
use of the letter Z and this is our last use for Z. Finally, note the use of dots .. in 
equation (9.49) to indicate the covariant metric tensor. We prefer to use this notation 
to the alternative Z = | Zy 1 , because in the latter, the indices i and j are neither 
live nor contracted and we do not employ indices in any other way. 

The variant Z is not a tensor. However it does transform according to a very 
interesting rule that makes it a relative tensor. Relative tensors are discussed in 
Sect. 9.9. Consider an alternative coordinate system Z' and the corresponding 


variant Z' 



(9.50) 


Our goal is to establish the relationship between Z' and Z. Let us remind ourselves 
of the relationship between Zy y and Zy : 



(9.51) 


Let the letter J denote the determinant of the Jacobian J], 


J = \ Jj . 


(9.52) 


Since Jl, and jf are the inverses of each other, the determinant of jf is J 1 
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From equation (9.51) we conclude, by the multiplication property of determinants, 
that 


Z' = Z/ 2 . (9.54) 

This equation confirms that Z is not a tensor. A variant that changes according to 
this rule is called a relative tensor of weight 2 (because of the 2 in the exponent). 
Despite not being a tensor, the variant Z is an object of utmost importance as will 
become evident in the remainder of this chapter. 

The quantity \[Z is called the volume element. In the two-dimensional plane, 
it may be referred to as the area element. On a line, it may be called the length 
element or the line element. When the dimension of the space is not specified, we say 
volume element. The quantity Z is the determinant of a positive definite matrix and 
is therefore positive. More accurately, it is the determinant of a positive semidefinite 
matrix and is therefore nonnegative. It may vanish at special points, such as the 
origin of a polar coordinate system. In either case, the taking of the square root is 
justified. 

You may be familiar with the term volume element from multivariable calculus, 
where it is defined with a reference to a Cartesian coordinate system. According 
to the multivariable calculus definition, the volume element in coordinates Z' is 
defined as the determinant J of the lacobian matrix , where Z' are Cartesian 
coordinates. The tensor definition has the advantage that it is given in absolute terms 
without a reference to a particular secondary coordinate system. 

The two definitions lead to nearly identical objects that may differ by a sign. 
Consider a Cartesian coordinate system Z' and an alternative coordinate system 
Z' . Then, according to equation (9.54), the object Z' in the primed coordinates is 
given by 


Z' = J 2 , (9.55) 

since Z = 1 in Cartesian coordinates. This is clear from equation (5.37) and is 
catalogued in equation (9.57). Taking square roots of both sides, we find 

V Z' = Absolute value (/) . (9.56) 

Therefore, the volume element \fZ' and the determinant J of the lacobian coincide 
when J is positive and are of opposite signs when J is negative. In other words, 
the volume element yfZ' agrees with the lacobian if the coordinate system Z' is 
positively oriented with respect to the Cartesian coordinates Z' . The orientation of 
a coordinate system was defined in Sect. 4.10. 

We now catalogue the values of the volume element for the most common 
coordinates. In Cartesian coordinates, 


>/Z = 1. 


(9.57) 
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In polar and cylindrical coordinates. 


VZ = r. 


(9.58) 


In spherical coordinates, 


\[Z = r sin 6. 


(9.59) 


Exercise 187. Confirm equations (9.57), (9.58), and (9.59). 

9.8 The Voss-Weyl Formula 

The magical Voss-Weyl formula for the calculation of divergence V, T 1 of a tensor 
field T l illustrates another use of the volume element \fZ\ 



(9.60) 


The Voss-Weyl formula expresses the divergence of a contravariant tensor field 
without a reference to the Christoffel symbols. It is therefore an interesting 
example of an invariant expression constructed in a nontensor way. Furthermore, 
it is a formula that has an attractive structure that is often recognized in various 
expressions. 

To confirm the Voss-Weyl formula, first apply the product rule: 



(9.61) 


Thus, we are therefore faced with the intriguing task of calculating the partial 
derivative 3\/Z/3Z'. The variant Z depends on the coordinates Z k via the entries 
of the metric tensor Z, ; - . Therefore, 3 Z / 3 Z k can be found by the chain rule 


3Z _ 3Z 3 Z u 
3 Zf ~ dZ~jJzf' 


(9.62) 


Recall equation (5.68) which gives the partial derivative 3Z,y /3 Z k \ 



(5.68) 


According to Exercise 186, the derivative of Z with respect to the entries of the 
metric tensor Z, ; is given by 
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9Z 

dZ~j 


zz ij . 


(9.63) 


Putting these equations together, yields 


9Z 

9Z^ 


= zz" (r iJk + Tj'ik ) , 


(9.64) 


or, equivalently, 


9Z 

dzt ~ 2ZTik - 


(9.65) 


The partial derivative 9 ~/Z /dZ k of the volume element \[Z is now computed by a 
simple application of the chain rule 


dVz _ 1 9Z 

”9Z*~ ~ 2 VzdZ 1 ' 


The final expression reads 


97z 

dZ k 


= Vzr;,. 


(9.66) 


(9.67) 


To complete the derivation of the Voss-Weyl formula, substitute equation (9.67) 
into (9.61) 


-^-^(Vzr) = f^ + r r|-, ( 9 . 68 ) 

*JZ 9Z' V / 9Z' **’ 

and note that the expression on the right-hand side coincides with the definition of 
V,- T‘ . 

The Voss-Weyl formula provides the most effective way to calculate the Lapla- 
cian V, V' F of an invariant field F. Since the Laplacian V, V' F is the divergence 
of T‘ = V' F and contravariant gradient V' F is given by 

.. dF 

V' F = Z' J - — r, (9.69) 

9Z^ 

the Voss-Weyl formula yields 

1 O / (^p 1 \ 

V, V' F = —= - — r ( x/zz' 7 — ) . (9.70) 

Vz 9Z' V dZJ ) 


Exercise 188. Use the Voss-Weyl formula to derive the Laplacian in spherical 
coordinates 


146 


9 Determinants and the Levi-Civita Symbol 


V; V'F 


1 3 / 2 3E\ 1 3 / . dF \ 1 3 2 F 

r 2 3 r \ dr ) r 2 sin 9 dd \ 3 9 ) + r 2 sin 2 6 dtp 2 


(9.71) 


This expression is equivalent to equation (8.39) which was obtained in a much 
more laborious fashion with the help of the Christoffel symbols. It is common 
not to expand any of the terms in this expression and to present the Laplacian 
in this appealing form. When looking at this expression, one can see upon it the 
characteristic imprint of the Voss-Weyl formula. 

Exercise 189. Use the Voss-Weyl formula to derive the expression (8.37) for the 
Laplacian in cylindrical coordinates. 


9.9 Relative Tensors 

The object Z is an example of a variant that, while not quite a tensor, transforms 
according to a very interesting rule given in equation (9.54). The transformation 
rule is nearly that of an invariant except for the additional factor of J 2 . There are 
other important objects that are nearly tensors in this sense. A variant T‘ is a called 
relative tensor of weight M if it changes according to the rule 

Tp = J M Tj jf jj,. (9.72) 

Thus, tensors are special cases of relative tensors corresponding to M = 0. To 
make the distinction clearer, tensors are sometimes referred to as absolute tensors. 
A relative tensor of order zero of weight M is called a relative invariant of weight 
M . As illustrated by the following exercises, relative tensors satisfy all of the 
appropriately adjusted properties of regular tensors. 

Exercise 190. Show that if S‘- k is a relative tensor of weight M and T t rs is a relative 
tensor of weight N , then Sj k T t rs is a relative tensor of weight M + N .In particular, 
if M = —N. then S‘ jk T[ s is a tensor. 

Exercise 191. Show that the result of contraction for a relative tensor of weight M 
is also a relative tensor of weight M . 

Exercise 192. Conclude that S‘J s k t is an absolute tensor on the basis of equa- 
tion (9.17). 

We now turn to the most interesting relative tensors and e‘ jk . Let us first look 
at the symbol and see what we get when we transform it as if it were a covariant 
tensor: 


eijkJt'jj'4- 


(9.73) 
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By equation analogous to (9.12), this expression equals Jej/j'k'- 

e ijk Jl,jp4 = Jeyyv. (9.74) 

Therefore, 

eyyv = J- X e ijk J\,j],Jl, (9.75) 

which leads to the conclusion that ey* is a relative covariant tensor of weight — 1 . 

What about e ljk l Interestingly, since the entries of e ljk are identical to those of 
e ijk , it can be viewed as the same kind of relative tensor as e ijk . On the other hand, 
let us try to interpret e' //c as some kind of contravariant tensor. Let us therefore 
transform e ljk contravariantly: 


eV'Ji’jfj?. 


According to equation (9.12), we find 


eVbjfjfjf = J-VW, 


or 


/J' k ' = Je ijk jj' J j' J k ' 


(9.76) 


(9.77) 


(9.78) 


and we can therefore interpret e‘ jk as a relative contravariant tensor of weight 1. 

These properties of ey* and e ,jk can help us determine the similar properties 
for determinants of second-order tensors. Suppose that ay is an absolute covariant 
tensor. Its determinant A is given by equation (9.15) and is therefore a relative 
invariant or weight 2. Similarly, the determinant A of an absolute contravariant 
tensor is a relative invariant of weight —2. Finally, as we have seen previously, the 
determinant of an absolute tensor with one covariant and one contravariant index is 
also an absolute tensor. 

Exercise 193. Show that the determinant of a relative covariant tensor ay of weight 
M is a relative invariant of weight 2 + 3 M and that in n dimensions the expression 
generalizes to 2 + n M . 

Exercise 194. Show that the volume element \/Z is a relative invariant of weight 
1 with respect to orientation-preserving coordinate changes. 

We now have all the machinery to introduce the Levi-Civita tensors — this 
author’s favorite objects. 
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9.10 The Levi-Civita Symbols 

Our goal is to adjust the relative tensors e ljk and eyt to produce absolute tensors. 
This can be accomplished by scaling these systems by the volume element -Jz 
which is a relative invariant (with respect to orientation-preserving coordinate 
changes) of weight 1. The result is the Levi-Civita symbols e' ji and ey*: 


s U k - e 

Vz 

(9.79) 

£/ jk — &ijk • 

(9.80) 


The Levi-Civita symbols are absolute tensors with respect to orientation-preserving 
coordinate changes. As absolute tensors, Ihe Levi-Civita symbols can be used 
effectively in defining the curl operator and the cross product of vectors. The Levi- 
Civita symbols are considered to be metrics — a special group of objects derived 
from the position vector R. The metrics now include the covariant and contravariant 
bases, the metric tensor, and now the volume element and the Levi-Civita symbols. 
The introduction of the Levi-Civita symbols completes the list of metrics. 

In Chap. 8, we discovered the metrinilic property of the covariant derivative. 
That is the fact that each metric vanishes under the covariant derivative. Does this 
property extend to the Levi-Civita symbols? The answer is yes and proving this 
property is the subject of the next section. 

Exercise 195. Confirm that the Levi-Civita symbols are absolute tensors with 
respect to orientation preserving coordinate changes. 

Exercise 196. Show that 


C, = e ijk e rst , (9.81) 

which offers yet another proof that S'/ s k t is an absolute tensor. 

Before we discuss the application of the covariant derivative to the Levi-Civita 
symbols, we point out an important notational point. Note that the symbol £,-,■£ 
admits two interpretations. On the one hand, it is defined by an explicit expression 
in equation (9.80). On the other hand, it can be interpreted as the result of lowering 
each index of the contravariant Levi-Civita symbol e rst : 

e ijk = e rst Z ir Z js Z kt . (9.82) 

Do these two distinct interpretation result in the same object? Fortunately, they do. 
Let us take the expression e rst Z, r Z j S Z kt as the starting point and substitute the 
definition of the Levi-Civita symbol e rst : 

P rst 7 .7 7 , , 

r rst ^7 r 7 ’y 11 ^ J s kt 

b ZjjfZjjsZj/ct — . — 


(9.83) 
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By equation (9.12), the expression in the numerator equals thus 


e rst Z ir Z is Z kt = 


Ze ijk 

~7z 


= VZe,- 


-jk- 


(9.84) 


We have therefore arrived at the definition of ey* showing that the two interpreta- 
tions are equivalent. 

Exercise 197. Show that the permutation e ljk and e kjk are not related by index 
juggling. 


9.11 The Metrinilic Property with Respect 
to the Levi-Civita Symbol 

Does the metrinilic property extend to our newest metrics, the Levi-Civita symbols? 
That is, do the Levi-Civita symbols vanish under the covariant derivative? The 
answer is yes and this section is devoted to proving this important property — the 
long way. Of course, there is the standard short argument that relies on the Euclidean 
nature of our space. Simply observe that V, s rst is a tensor that vanishes in Cartesian 
coordinates. Therefore, it vanishes in all coordinates, Q.E.D. 

The long way is a valuable technical exercise. It entails a direct application of the 
covariant derivative to the definition of the Levi-Civita symbol. It has the important 
advantage that it will continue to work on curved surfaces. 

We first evaluate V, e rs( , that is, the covariant derivative applied to the permuta- 
tion symbol. Note that, despite the fact that the entries of e rst are spatial constants, 
there is no reason to expect that the covariant derivative of e rst vanishes. As we 
discussed in Chap. 8, this property of the partial derivative does not extend to the 
covariant derivative (nor should it). 

Introduce the symbol, Tj rst : 


Tirst = V,c m . (9.85) 

Since their partial derivative de rst / dZ' vanishes, T irst consists of the following three 
terms 


Tirs, = -r ™e ms t - T? s e rm t ~ K t e rsm . (9.86) 

From this expression we can conclude that T\ rst is skew-symmetric in the indices 
rst. Swapping r and s, we find 

Tisrt — ^i@srt — ^i^rst — T\ rs t, 


(9.87) 
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confirming the skew-symmetry. Therefore, we need to consider only 7} 123 which is 
given by 


T im = -r>,„23 - r™e lm3 - r; n 3 e, 2 m- (9.88) 

In each contraction, only one term survives: 

Ti 123 = — r.\ e 1 23 — r i 2 2 e 12 3 — r,^ 12 3 . (9.89) 

Factor out ei 23 and summarize this identity using a contraction: 

7)123 = -r m V 123 . (9.90) 

This argument can be generalized for other permutations of 123, yielding 

Tirst = -r* (9.91) 

The metrinilic property of s rst can now be demonstrated. By the product rule, 

V, s rst = V,- = -^r-e rst + VzV,e m . (9.92) 

We have calculated the partial derivative dyfZ/dZ' previously and the result be 
found in equation (9.67). Combining equations (9.67) and (9.91), we find 

V,- Erst = Km Crs, ~ Ki * rst = 0, (9.93) 


and the proof is complete. 

Exercise 198. Show that the metrinilic property applies to the contravariant Levi- 
Civita symbol s rst . 

Exercise 199. Use the metrinilic property of the Levi-Civita symbols to show the 
same for the delta systems: 

V,„C, V B «y, V m S' r =0. (9.94) 


9.12 The Cross Product 

The Levi-Civita symbols, being tensors, are key to the analytical definitions of the 
cross product and the curl. The cross product W' of two vectors U' and V is 
defined by 


W' = s ijk UjV k . 


(9.95) 
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Since all the elements on the right-hand side of (9.95) are tensors, W' is a tensor as 
well. To form an invariant, contract W' with the covariant basis Z, to form W = 
W'Zj. Then 


W = e ijk UjV k Zi. (9.96) 

In dyadic notation, the relationship among U, Y, and W is denoted by 


W = U x Y. (9.97) 

We are now thoroughly enjoying the power of tensor calculus! We wrote down 
equation (9.95) with great ease, confident that we produced the right equation. 
Although this definition is algebraic, we are assured of its geometric meaningful- 
ness. Why? — because the result is a tensor! Furthermore, equation (9.95) gives us a 
whole lot more, including an explicit algorithm for constructing the gradient in any 
coordinate system. (The definition given in most calculus books certainly does not 
give us that.) 

Furthermore, the definition (9.95) gives us an expression that can be further 
manipulated. For example, let us evaluate the divergence V, W' of the cross product. 
By the product rule and metrinilic property, the Levi-Civita symbol e ljk passes 
through the covariant derivative V,- 

V,- W' = e ijk V,- ( Uj V k ) - (9.98) 

Subsequently, by the product rule, 

V,- W' = e ijk V, Uj V k + s ijk Uj V,- V k . (9.99) 

This expression does not admit any further analysis and constitutes the final answer. 
The combination e' j/ ‘ V,- Uj is known as the curl of Uj and is the subject of the 
following section. 

Just as easily, all the other fundamental properties follow from the defini- 
tion (9.95). First, let us show the antisymmetric property of the cross product. It 
follows instantly from the skew- symmetric property of the Levi-Civita symbol. Let 

W{=s ijk UjV k (9.100a) 

Wl=e ijk VjU k . (9.100b) 

Then, by a combination of steps, that we may call switch-rename-reshuffle, we have 

w; = e ijk UjV k = —s ikj Uj V k = -e ijk U k Vj = -s ijk VjU k = -W‘, Q.E.D. 

(9.101) 

In dyadic notation 


U x V = -V x U. 


(9.102) 
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The antisymmetric property can be expressed by the attractive equation (9.102) 
in dyadic notation. It may appear as a disadvantage of the tensor notation that there 
is no such expression in tensor notation. In fact, in tensor notation there is not even 
a symbol for the cross product. However, we see this as an advantage rather than as 
a disadvantage. On the one hand, we save yet another symbol that helps prevent our 
framework from being overwhelmed by the multitude of operators (not that we are 
in any danger) and we are also able to view the anti-symmetry of the cross product 
as a near triviality, given the skew-symmetry of the Levi-Civita symbol, rather than 
as a stand-alone property of a novel operator. 

The cross product of U l and V J is orthogonal to each of these vectors. Indeed, 
dotting W' and V , we find 


W‘Vi = e ijk UjUjV k = 0, 


(9.103) 


which also follows from the skew-symmetric property of the Levi-Civita symbol 
and can be shown by a switch-rename-reshuffle. This is a fundamental and familiar 
property of the cross product. We have obtained it as a consequence of the algebraic 
definition (9.95). Historically, it arose geometrically, where orthogonality was the 
key element of the definition. This is an instance, where tensor calculus inverts the 
traditional perspective and gives preference to the algebraic viewpoint. 

Exercise 200. Show that s ,jk U, Uj V k = 0 by a rename-switch-reshuffle. 

Next, let us calculate the length of the vector W' . It is an excellent technical 
exercise for working with Levi-Civita symbols. The length squared of the vector 
W‘ is given by Wj W' : 


WiW‘ =e ijk U i V k s irs U r V s . 


(9.104) 


Combine the two Levi-Civita symbols into a single 8- symbol: 


W i w i = 8)Y k U j V k U r V s . 


(9.105) 


By equation (9.21), 


it, if" = sy k u j v k u r v s . 


(9.106) 


and by equation (9.26), 



(9.107) 


Continuing, we find 


Wi W i = U j V k Uj V k -U j V k U k Vj . 


(9.108) 
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Rewriting this equation in a more attractive form yields 

Wi W = Uj U j V k V k - Uj V j V k U k . (9. 109) 

This is the final answer. In dyadic form, it reads 

|W| 2 = |U| 2 |V| 2 - (U • V) 2 . (9.110) 

Exercise 201. Show that |W| = |U| |V| sin a where sin a is the angle between U 
and Y. 

Exercise 202. As practice with working with Levi-Civita symbols, derive the 
expression for the cross product of three vectors. You should obtain an expression 
with a dyadic form that reads 

(U x V) x W = (U- W)V- (V- W)U. (9.111) 

Conclude that the cross product is not associative. 

Exercise 203. Show that the indices can be rotated in the expression for W to yield 

W = e ijk UjVjZ k . (9.112) 

Exercise 204. Show that 

U • (V x W) = (U x V) • W. (9.113) 

Exercise 205. Show that in Cartesian coordinates, the cross product is given by 

i j k 

W = U l U 2 U 3 (9.114) 

V 1 V 2 V 3 

Exercise 206. Show that in cylindrical coordinates, the cross product is given by 

rZ 1 r Z 2 rZ 3 

W = U 1 U 2 U 3 . (9.115) 

v 1 V 2 V 3 

Exercise 207. Derive the expression for the cross product in spherical coordinates. 
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9.13 The Curl 

The curl is an invariant differential operator which can be defined with the help 
of the Levi-Civita symbol. The term curl was coined by J. Clerk Maxwell in his 
celebrated Treatise on Electricity and Magnetism [30], Given the tensor field Uk, its 
curl V is defined as 


V i =e iJk V j U k . (9.116) 

In dyadic notation, the curl is denoted by a new symbol Vx: 

V = V x U. (9.117) 

As an illustration, let us analyze the curl of a curl of a vector field. In dyadic 
notation, let V = V x V x U. Then 

Vr = ersiV’(j Jk VjU k ). (9.118) 

By the metrinilic property of the Levi-Civita symbol, 

V r =e rsi e ijk V s VjU k . (9.119) 

Combine the Levi-Civita symbols in to a single 5-symbol 

v r = tf*V>V j U k , (9.120) 

and cycle the contravariant indices of the 5-symbol to put the index i in the last 
position: 


Vr=8 J r *y s VjU k . (9.121) 

The rest of the analysis parallels that of the example in preceding section. By 

equations (9.21) and (9.26) we have 

V r = 8 J r S*V s VjU k - 5*5/V s V,-C4, (9.122) 

which yields the near-final expression 

V r = V k V T U k ~ V s V s t/ r . (9.123) 

Renaming indices and switching the order of the covariant derivatives, we arrive at 
the final identity 


Vi = ViVjU J -WjW J Ui. 


(9.124) 
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In dyadic notation, this identity reads 


V x V x U = V (V ■ U) - V • VU. 


(9.125) 


Exercise 208. Analyze the curl applied to a cross product. Show that W = V x 
(U x V) is given by 


W' = Vy U J V + Vy U' V 1 - U J Vy V + U’ Vy V 1 . 


(9.126) 


Although there is no good way to do so express this relationship in dyadic notation. 
Exercise 209. Show that divergence of curl vanishes. 

Exercise 210. Show that curl of a gradient vanishes. 


9.14 Generalization to Other Dimensions 

The ideas presented so far in this chapter naturally generalize to any number of 
dimensions. We spell out the generalization to two dimensions and outline the 
generalization to higher dimensions. 

Suppose that all indices can now have values 1 or 2. The permutation symbols 
e,y and e ,J are defined analogously to equation (9.8) and therefore have the values 
captured by the following matrix 


The full delta system 8' r J s is defined by 



(9.128) 


and satisfies the properties 
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and 


<$'• = S‘/ r , (9.130) 

where the Kronecker symbol <51 has the usual definition. 

The determinant A of a 92 x 2 system a'j is given by 

A=U i /Xa s j , (9.131) 

and similarly satisfies the product property. 

The definition (9.49) of the object Z is valid in all dimensions. Its square root 
\/Z is the volume element which, in two dimensions, could be referred to as the 
area element. However, we will stick to the term volume element and reserve the 
term area element for embedded surfaces in Part II. 

The Levi-Civita symbols e ;; and e' 7 are defined according to the formulas 


Sij — Vz.eij 


Sz' 


The cross product involves a single tensor U ' : 

Vi = SijU j , 

and produces a vector that is orthogonal to U J , since 

V,U‘ = e ij U i U j = 0. 


(9.132) 

(9.133) 


(9.134) 

(9.135) 


Exercise 211. Explain why e,y U'U J = 0 in equation (9.135). 

Exercise 212. Show that the length of the vector V equals the length of the 
vector U' . 

The divergence and the Laplacian operators are defined identically in all 
dimensions, and the Voss-Weyl formula is universal, as well. The curl of a vector 
field U' is the scalar field V given by 

V = e u VU j . (9.136) 

Next, let us discuss generalization to a general n -dimensional case. The permu- 
tation symbols e and e"'" ,n have n indices and yield the value 1 if all indices 
are different and form an even permutation, —1 if odd permutation, and 0 otherwise. 
The full delta system <5 is defined by 


e jvjn > 


( 9 . 137 ) 
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and can be expressed by the determinant 


8 


h-in 

jl"'jn 


s h 


S n 


V} 

Jn 


S'" 

Jn 


(9.138) 


The delta systems are related by contractions and a coefficient that equals the 
number of missing indices: 

8 'jv"jk-i = ( n ~ k) ( note contract i° n on fit)- (9.139) 

In particular, 

s;;::;;* = k\ ( 9 . 140 ) 


The determinant A of an n x n matrix a 1 - is given by the equation 


The cross product involves n — 1 vectors U^, ■ ■ ■ U[„_d 

Vi=e ijv .j H _ 1 U*,-Uj;il ) (9.141) 

and the curl is not defined. 


9.15 Summary 

This chapter offered us a welcome opportunity to use the tensor framework 
developed over the preceding chapters. The object of our study here was the 
determinant. In the course of our discussion, we introduced the 5-symbols, which 
proved to be tensors of tremendous utility. We proceeded to define the determinant 
in tensor notation. The tensor notation proved to be a natural turn to expresses the 
determinant and enabled us to offer simple proofs of the key properties, including 
the one-line proof of the product property \AB\ = A B in equation (9.34). 

The Levi-Civita symbols s ljk and ey* were the stars of the second half of this 
chapter. Along with the volume element \[Z, the Levi-Civita symbols complete the 
list of metrics. We demonstrated that the Levi-Civita symbols are tensors and that 
the metrinilic property applies to them. Finally, we used the Levi-Civita symbols to 
introduce the vector product and the curl. 


Part II 

Tensors on Surfaces 



Chapter 10 

The Tensor Description of Embedded Surfaces 


10.1 Preview 

We have finally arrived at the subject of surfaces. This topic is extraordinarily rich 
and it lets tensor calculus shine at its brightest. We will focus on two-dimensional 
surfaces in three-dimensional Euclidean spaces. The Euclidean space in which the 
surface is embedded is called the ambient space. The analysis presented in this 
chapter is easily extended to hypersurfaces of any dimension. A hypersurface is 
a differentiable (n — 1 )-dimensional subspace of an //-dimensional space. That 
is, a hypersurface is characterized by a codimension of 1, codimension being the 
difference between the dimensions of the ambient and embedded spaces. Curves, 
the subject of Chap. 13, embedded in the three-dimensional Euclidean space have 
codimension two. 

A useful term that applies to subspaces of any codimension is manifold. The 
meaning of this term, originally introduced by Henri Poincare in Analysis Situs, 
has undergone an evolution in recent decades. Its currently accepted definition is 
given in the context of topology. We adopt a narrower meaning that is closer to 
Poincare’s original definition: a manifold is a subspace of the Euclidean space 
that can be parameterized by smooth functions defined on a region of M m , as in 
equations (10. la)— (10. lc). For m = 1, the manifold is a curve. For m = n — 1, the 
manifold is a hypersurface. 

This chapter focuses on the tangent and normal spaces that exist at all points on 
the surface where the surface is sufficiently smooth. For a hypersurface, the tangent 
space is ( n — l)-dimensional and is spanned by a set of /? — 1 vectors that form 
surface covariant basis. The normal space is one-dimensional and is spanned by a 
single vector N called the normal. 
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10.2 Parametric Description of Surfaces 


There are various ways to specify a surface. We describe surfaces parametrically. 
That is, we choose two variables S 1 and S 2 (collectively, S a ) and let each of the 
ambient coordinates Z 1 , Z 2 , and Z 3 depend on S a : 

Z 1 = Z 1 (5',S 2 ) 

(10.1a) 

z 2 = z 2 (s\s 2 ) 

(10.1b) 

z 3 = z 3 (s 1 , s 2 ) . 

(10.1c) 

For example, if the ambient Euclidean space is referred to Cartesian coordinates, 
then the equations 

x ( 9 , <p) = A sin 9 cos f 

(10.2a) 

y ( 9 , <p) = B sin 0 sin^> 

(10.2b) 

z(9,(p ) = C sin0 

(10.2c) 


describe an ellipsoid with semiaxes A, B . and C . The coordinate lines for this 
parameterization are shown in Fig. 10.1. 

The variables S a are called surface coordinates. We use letters from the Greek 
alphabet for surface indices, which assume values from 1 to m. We continue to 
use the Latin alphabet for ambient indices which change from 1 to 77 . Most of the 
equations in this chapter apply for arbitrary n and m and will point out when that is 
not the case. Our primary interest is two-dimensional surfaces (in = 2) embedded 
in three-dimensional Euclidean spaces (n = 3). 


Fig. 10.1 The surface of an 
ellipsoid referred to in 
spherical coordinates, a 
natural choice 
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10.3 The Fundamental Differential Objects on the Surface 

Rewrite equations (lO.la)-(lO.lc) in tensor notation 

Z ! = Z' (5) , (10.3) 

where we enumerated the ambient variables and suppressed the surface index of the 
function argument. 

The important role of the shift tensor becomes evident when we introduce the 
covariant basis S„. It is constructed in a way that is analogous to the ambient 
covariant basis Z, , the definition of which is given in equation (5.2). Consider the 
position vector R as a function of surface coordinates 5“: 

R=R(5). (10.4) 

The covariant basis S„ is defined by partial differentiation 

3R 

S “ = 35“ ‘ (1 °- 5) 

The geometric interpretation of S„ is straightforward: these vectors are tangential 
to the surface 5; more specifically, they are tangential to the coordinate lines 
corresponding to S 1 and S 2 . The plane spanned by the vectors S„ is called the 
tangential plane (Fig. 10.2). 

There is a natural connection between the surface and ambient bases. It comes 
from the identity 


R(5) = R(Z(5)), (10.6) 

which states this obvious fact: if we consider R as a function of the ambient 
coordinates Z' and substitute into that function the equation of the surface (10.3), 
then the result of the composition is the position vector R as a function of the surface 
coordinates S a . Differentiate the identity in equation (10.6) with respect to S": 


Fig. 10.2 A curved surface, 
its coordinate lines, and a 
covariant basis at one of the 
points 
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9R _ 9R 9Z' 
95 “ “ 9 ^ 95 “’ 


(10.7) 


where 9Z' /95“ are the partial derivatives of the equation of the surface (10.3). 
These derivatives are denoted by the symbol Z' a known as the shift tensor. 


7' 


9Z '' 
95“ 


With this new symbol, equation (10.7) can be rewritten as 

s a = Z'Z,, 


(10.8) 


(10.9) 


Thus, the shift tensor relates the surface and the ambient basis. Equation (10.9) is 
only one of out many important relationships in which the shift tensor plays a key 
role. The shift tensor is prominent is all identities that relate ambient and surface 
objects. 

We see from equation (10.9) that the entries of the shift tensor Z' a are the 
components of the surface covariant basis vectors S„ with respect to the ambient 
basis Z, . Dotting both sides of this equation with 7J (and the renaming j — »• i ), 
we obtain an equation that is equivalent to (10.9) and gives an explicit expression 
for Z': 


Z' = S„-Z'. (10.10) 

It may be said that the shift tensor represents the tangent space. Indeed, suppose 
that a vector T lies in the tangent space and has components T ' : 

T=r'Z ; . (10.1 la) 

The tensor T' is called the ambient components of vector T. Then the system T‘ 
can be represented as a linear combinations of the systems Z\ and Z' 2 , and that is 
the sense in which Z' a represents the tangent space. 

Let us demonstrate why the components T‘ of any tangent vector T can be 
represented by a linear combination of Zj and Z\. By definition, T can be 
represented by a linear combination of the surface covariant basis 

T = T a S a . (10.12) 

The tensor T a represents the surface components of the vector T. Substitute 
equation (10.9) into equation (10.12): 


T = T a Z' Z j 


(10.13) 
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In combination with (10.11a), this equation leads to the conclusion that 

V = T a Z l a (10.14) 

which shows T l as a linear combination of Z\ and Z' 2 and indicates that the coeffi- 
cients of this linear combination are the surface components of T. Equation (10.14) 
also justifies the name of the shift tensor: it translates or shifts components of tangent 
vector from surface to ambient. 

This shifting works in both directions. We show below that the following identity 
holds 


T° = r Z“. (10.15) 

However, we have not yet developed all the machinery to use the shift tensor Z“ 
with an upper surface index because we have not yet introduced the surface metric 
tensor. We now turn to the task of developing the metric tensors. 

The surface covariant metric tensor S a p is defined analogously to the ambient 
metric tensor: 


Sal J = S„-S„. (10.16) 

In words, the entries of the metric tensor are the pairwise dot products of the 
covariant basis elements. Compare equation (10.16) to the analogous definition 
of the ambient covariant metric tensor in equation (5.7). Therefore, by similar 
arguments, the covariant metric tensor is symmetric 

Safi = Sfto, (10.17) 


and positive definite. 

Naturally, the ambient and the surface covariant bases are related by the shift 
tensor. Equation (10.9) helps derive that relationship: 

Sap = S„ ■ S P = Z, Z' • Z jZ J p = Z :j Zl zj,. (10.18) 

In brief, 

Sap = Z ij Z i a Z J p . (10.19) 

By letting the metric tensor lower one of the ambient indices, we may rewrite 
equation (10.19) as 


Safi — Z ia Z p . 


( 10 . 20 ) 
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The contravariant metric tensor S 01 ^ is defined as the matrix inverse of S a p : 

S^S/ty = 8 a Y , ( 10 . 21 ) 

where <5“ satisfies the usual definition of the Kronecker symbol. The contravariant 
metric tensor is symmetric and positive, definite and much of what could be said of 
the ambient metric tensor can also be said of the surface metric tensor. Now that we 
have the covariant and the contravariant metric tensors, we have rules for juggling 
surface indices. For instance, 


Z ifi = Z^S^. (10.22) 

Further, by raising the index a, we can rewrite equation (10.20) in the following 
beautiful way 


Z“Z‘ = 5“ (10.23) 

In other words, the objects Z“ and Z^ are matrix inverses of each other in the sense 
of contraction by the longer dimension. 

Exercise 213. Explain why it is not possible that Z' a Z“ = S'j. 

Exercise 214. Derive equation (10.15) from equation (10.14). 

The rest of the surface metrics are defined analogously to their ambient counter- 
parts. The object S is defined as the determinant of the covariant metric tensor 

S = \S..\ (10.24) 

and its square root s/~S is the area element , by analogy with the volume element 
\[Z in the ambient space. The Levi-Civita symbols are defined according to the 
formulas 


s a p — v/i S e a p 

(10.25a) 

e a P 

F <*P - e 

Vs’ 

(10.25b) 


where e a p is the skew-symmetric system with nonzero entries 

Cl2 = — C21 = 1. (10.26) 

The surface 5-symbols are also defined analogously to their ambient counterparts. 


10.4 Surface Tensors 
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10.4 Surface Tensors 

In what sense is Z' a a tensor? We are working with two simultaneous systems of 
coordinates, ambient coordinates Z', and surface coordinates 5“. Both choices are 
arbitrary and we must consider the transformations of variants under changes of 
both coordinate systems. Suppose that Z' and 5“ are two alternative coordinate 
systems. The ambient coordinates Z' and Z' are related by equations (4.63) 
and (4.64) and the surface coordinates 5“ and 5“ are related by the following 
mutually inverse relationships: 


5“' = 5“' (5) 
5“ = 5“ (5') . 


Define the Jacobian objects and /“ by 

ds a ' (S) 


= 
J a 

I 


35“ 

35“ (5') 
35“' ' 


(10.27) 

(10.28) 


(10.29) 

(10.30) 


We now define the tensor property for surface variants. Consider a variant T'“ with 
a fully representative collection of indices. Suppose that its values in the alternative 
coordinate systems is Then Tj ^ is a tensor if 


' fi'a ' 7 not ji' jj ra' tP 

V ~ I jP J i J P r ’ 


(10.31) 


and it is clear how this definition should be modified for other index signatures. 

Let us demonstrate that the shift tensor Z‘ a is indeed a tensor. Its entries Z' , in 
the alternative coordinates are given by the partial derivative 


Z' 


3Z''(5') 

35“' 


(10.32) 


where Z 1 ' (S') is the parametric representation of the surface in the alternative 
coordinates. The three functions Z'' (5') can be constructed by the following double 
composition 

Z'" (5') = Z'' (Z (5 (5'))) , (10.33) 

where Z 1 ' (Z) gives the new ambient coordinates in terms of the old ambient 
coordinates, Z (5) represents the surface in old coordinates and 5 (5') gives the 
old surface coordinates in terms of the new surface coordinates. By a repeated 
application the chain rule, we have 
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or 


9Z _ az ,v 9Z' 9S“ 
9Z0 7 ~~ 9Z^95“ 95^’ 


(10.34) 


7' — 7' T‘ 

— ^ a J i J u'’ 


(10.35) 


and the proof of the tensor property of the shift tensor is complete. 

At this point, we only need to mention that the entire contents of Chap. 6 carry 
over naturally to surface tensors. We conclude that S a , S a p, S a P, s e a p are tensors 
and that S, e a p and e are relative tensors. 


10.5 The Normal N 


The covariant basis S„ and its ambient components captured by the shift tensor Z‘ a 
are analytical representations of the tangent plane. The normal direction is captured 
by the normal N with components N 1 . The normal vector N is defined by the 
identities 


N • S„ = 0 (10.36a) 

N • N = 1, (10.36b) 

where the first equation states that the normal is orthogonal to the tangent plane 

and the second equation states that the normal is unit length. These equations are 

satisfied by two equal and opposite vectors and the choice of normal depends on the 
particular problem. For closed surfaces, the exterior normal is often chosen. 

Denote the contravariant components of the normal vector by N‘ : 

N = N'T,. (10.37) 

We will later establish the following explicit expression for N ' 

N‘ = l -s iJk e a pZ-Z p k . (10.38) 

z 

However, this expression is not needed to derive most of the relationships involving 
N‘ . For example, equation (10.36a) implies 

N i Z i a = 0. (10.39) 

To show this, we write 

0 = N • S a = N j Zj ■ Z' Z, = N J ZyZij = Nj Z' a . (10.40) 

Similarly, equation (10.36b) implies 


NiN' = 1 . 


(10.41) 
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10.6 The Normal and Orthogonal Projections 

The normal projection of a vector Y is a vector P that point along the normal N and 
has the length such that the difference Y — P is orthogonal to N. From geometric 
considerations, it is evident that the length of P equals V • N and therefore 

P = (V • N) N. (10.42) 

If the geometric reasoning does not convince you, the validity of equation (10.42) 
can be established algebraically. 

Exercise 215. Show that (V — P) ■ N, where P is defined by equation (10.42), is 
zero. 

In component form, equation (10.42) reads 

P = NjN'Zj. (10.43) 

Therefore, the components of P' of P are given by 

P^^NjVK (10.44) 

The tensor 

N' Nj (10.45) 

is therefore the projection operator onto the normal N. In alternative terminology, it 
is the projection away from the surface S . 

Exercise 216. Denote the tensor N' Nj by Pj . Show that 

P)Pl = P l k . (10.46) 

In other words, a repeated application of this projection operator leaves the 
projection unchanged. In matrix notation, this property reads P 2 = P . From this 
form, it is evident that all eigenvalues of P or either 0 or 1 . 

We now turn to the orthogonal projection onto the surface S. The orthogonal 
projection of a vector V is the vector T in the tangent plane such that V — T 
is orthogonal to the tangent plane. The orthogonal projection T is given by the 
remarkably simple expression 


T= (V-S“)S„. 


(10.47) 


It is evident that T lies in the tangent plane. 
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Exercise 217. To show Y — T that is orthogonal to the tangent plane, show that 
(V — T) • = 0. 

In component form, equation (10.47) reads 

T = E y Z“Z^Z/, (10.48) 

which yields the component T l of the vector T 

V = Z i a Z a j V i . (10.49) 

Therefore, the tensor 

Z^Z“ (10.50) 

is interpreted as the operator of projection onto the surface S. 

The compactness of the expression Z^Z" is indeed remarkable. If you will recall 
from linear algebra, the problem of finding the orthogonal projection onto a linear 
subspace is solved by the technique of least squares [44]. If A is the rectangular 
matrix that has columns spanning the linear subspace (i.e., A corresponds to Z‘ a ), 
then the projection operator is given by the combination 

A(A T A)~ l A T . (10.51) 

Expression (10.50) includes all of the same elements; however, the cumbersome part 
of equation (10.51) ( A 1 A ) 1 is captured by the raised surface index of the second 
shift tensor. This is because A r A is the matrix expression for S a p and ( A r A j 1 is 
the matrix expression for 5“^. 

Exercise 218. Denote the tensor N' Nj by Tj . Show that 

7)7/ - 77. (10.52) 

The vector Y equals the sum of its normal and orthogonal projections 

V = P + T. (10.53) 

This identity is evident geometrically, but can also be shown algebraically. For an 
algebraic proof, note that the set of vectors {N, Si, S 2 } forms a basis for the three- 
dimensional linear space at any point on the surface. Since dotting equation (10.53) 
with each of these vectors produce the same result on both sides, the validity of this 
equation is confirmed. 

In component form, equation (10.53) reads 

1 = (A VNj + Z^Z“) V j . 


V 


(10.54) 


10.7 Working with the Object N' 
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Since this identity is valid for any V J , we arrive at the fundamental relationship 

N‘Nj + Z;Z“ = 8). (10.55) 

This beautiful relationship is used frequently in tensor calculus, and you will 
encounter it often in the remainder of this book. 

Exercise 219. Derive equation (10.41) from equation (10.55). 


10.7 Working with the Object N l 

We derived the key properties of the normal N and its components N' without the 
explicit formula (10.38): 


N‘ = ] -s‘ jk s a pZ c 'Z^. (10.38) 

z 

In applications, the formula (10.38) is often used to calculate the actual components 
N‘ . We note that equation (10.38) applies to two-dimensional surfaces. For other 
dimensions it needs to be modified. For a two-dimensional ambient space, the 
equation becomes 


N‘ 


e lJ e a Z“. 


For an n -dimensional ambient space, the definition of N' reads 

1 


N = 


(«-l)! 


1 yU 1 yu 

^ j. . ■ ■ ^ , 


(10.56) 


(10.57) 


The following discussion applies to the three-dimensional ambient space, but can 
be easily extended to hypersurfaces in any dimension. 

We first establish that formula (10.38) is indeed a correct expression for the 
components of the normal. In other words, we must verify equations (10.39) 
and (10.41). To verify (10.39), we write 

N*zr = ^eapZ'zf'zl (10.58) 

z 

For any system T" , the combination e'' k T" 7"'° is fully skew-symmetric on the 
indices a, ft, and y. For instance, switching a and ft, we find by the swap-reshuffle- 
rename strategy 
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Being fully-skew symmetric, e ljk 7’“ T? vanishes if any of the indices a, fi, 
or y have the same value, which is unavoidable since these three indices can 
assume only two different values. Therefore, s'^ k Z a jZ^ Z y t vanishes identically and 
equation (10.39) is confirmed. 

Equation (10.41) presents a more interesting opportunity of working with the 
Levi-Civita symbols. Let us begin by setting up the expression NjN' 

Ni N‘ = s ijk E at> ZiZ k p s irs e po Z p r Z a s . (10.60) 

First, combine the Levi-Civita symbols into S-symbols: 

NiN 1 = (10.61) 

Recall that 

Ka = S "A ~ ( 10 - 62 ) 

Thus, the right-hand side in equation (10.61) breaks up into two terms. In the first 
term, p becomes a and a becomes ft, and visa versa in the second term: 


N,N l 


1 


z/ziz-z;’ - -&izlz\zfiz 


u ijk 


ijk‘ 


(10.63) 


Next, recall that 


and 


°jk 


virs Qrs 
°ijk - s jk 

= S r jSl -8*81 


(10.64) 


(10.65) 


We therefore have four terms 


1 


NiN‘ = -ZiZ k a Z“Zt - -ZiZ k Z%Z? - -Zl ZflZ; Z" + T Z'Z£Z£Z?. 


4 


k ’ 


1 


jk * 


P^k‘ 


( 10 . 66 ) 

The key to the next step is equation (10.23). Using that identity twice in each term 
yields 


NjN 1 = -< 5 “^ - -SgSi - -8^S a o + 

1 4 “ p 4 “ p 4 “ p 4 “ p 


(10.67) 


It is left as an exercises to confirm that the sum of the four terms on the right-hand 
side is indeed 1 . 


10.7 Working with the Object N' 
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The final relationship to be verified is equation (10.55). The derivation essentially 
follows that of equation (10.41). Form the combination N' Ay 


N' N r = -e ijk £ 0lfl Z 0 ‘Z P k e rst e ot Z s o Z , t , 


and combine the Levi-Civita symbols into (5-symbols: 


N' N — -8' kk 8 az 7 a 7 & 7 s 7' 
ly r ~ ^°rst° a p Zj j 


By the identity 8°^ = S°8p — 8 z Sp, we have 


ATI it _ 1 v‘jk ya yP ys yt _ }_S'j k yu yP ys yt 
— Orst^j^k^a^p ,°rst ^ j ^ k ^ P^a’ 


For brevity, let 


r j~’i ryi ’~70L 

1 j ~ 


and recall from an earlier exercise that 

T\T,{ = T, [ 

Further, 


l j 


V 


(10.68) 


(10.69) 


(10.70) 


(10.71) 


(10.72) 


(10.73) 


Then 


N l N r 


-X k T]r k --X k T)T k . 


(10.74) 


Exercise 220. Show that the two terms in equation (10.74) are equal, thus 


N 1 Nr = ^ s k t TjT* 


(10.75) 


According to equation (9.20), we find 

N* Nr = 2 (8‘8 J J k - 8 J r 8' s 8 k + S^S* - 8 k r 8{8) + 8^ - StfSf) T]T‘, 

(10.76) 

which yields the following six terms 

TV' N r = - (s‘ r Tj T k - Si T) T k + 8 ] r T k T; - 8 k r 7'/ T[ + 8 k r T) T{ - 8\. T k TJ ) . 

(10.77) 
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With the help of equations (10.72) and (10.73), we simplify each term 

N' N r = ] - (4 Si - 2T‘ + T‘ - 2T' + T‘ - 23') . (10.78) 

We have therefore obtained 

N‘A' r = S' r - V r , (10.79) 

which is precisely the relationship that we set out to prove. 


10.8 The Christoffel Symbol T“ 

The Christoffel symbol measures the rate of change of the covariant basis with 
respect the coordinate variables. Recall the definition of the ambient Christoffel 
symbol in Chap. 5: 


3Z i 

3Z7 


T U Zk - 


(5.56) 


The analogous definition dS a /d = T^S y is not possible on an embedded 
surface. This is because the covariant basis S„ is capable of representing vectors 
only in the tangent plane. On the other hand, there is no reason to expect the vectors 
3S a /dS^ to lie in the tangent plane. If the surface curves, at least some of the four 
vectors 3S a /3 will have a component along the normal direction. 

We will therefore define the Christoffel symbol 0^, by analogy with equa- 
tion (5.60): 


The definition reads 



Py 



(5.60) 


(10.80) 


Since the central function of the Christoffel symbol is to facilitate covariant 
differentiation, our primary concern is that the Christoffel symbol transforms 
according to a certain rule and that is certainly achieved by definition (10.80). That 
rule is 


-pc/ pa ja tP tY ja ja 

1 P'y' — 1 py J a J J y' + J P'y /J a 


(10.81) 


Exercise 221. Derive equation (10.81) from the definition (10.80). 


10.9 The Length of an Embedded Curve 
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The Christoffel tensor can also be defined bypassing the covariant basis S ff and 
by referencing the covariant basis S a p 


pa _ yaa > ( ^$0)0 dSgiy 

Py 2 V 3 S y dSP 



(10.82) 


which is analogous to equation (5.66). The possibility of this definition takes on 
great significance in the discussion of Riemann spaces. 

Exercise 222. Derive equation (10.82). 

Finally, we determine the relationship between the ambient and surface Christof- 
fel symbols. Into equation (10.80) substitute the expression for the surface basis in 
terms of the ambient basis 


pa 'yurji 

1 fly ~ Z i L 


i z i z i) 

ds y 1 


(10.83) 


By the product rule 


9 Z{ 


9Z, 


r? v = z“ z' — — z ,■ + z“z J R z' — J - 

py i zcvJ 1 p qcv 


dSr 


dsy 


(10.84) 


In the first term, the dot product of the basis vectors gives S'- . In the second term, 
the partial derivative is analyzed by the chain rule. The final relationship is 


dZ‘ 

r ya P , -pz r jOL r y ] r yk 

l P y ~ z ' 957 + l jk z ‘i ^fi A f)- 


(10.85) 


This relationship offers an effective way of calculating the surface Christoffel 
symbol. 

Exercise 223. Show that when the ambient space is referred to affine coordinates, 
the relationship between the surface and the ambient Christoffel symbols simpli- 
fies to 


9 zi 

pa 'ya P_ 

P y ~ 957 • 


(10.86) 


10.9 The Length of an Embedded Curve 

Suppose that a curve that lies in the surface S is given parametrically by 
(S 1 (t) , S 2 (0). Collectively, 
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S a = S a (t). 


(10.87) 


This curve can also be considered as a curve in the ambient space where its 
parametric equations are obtained by composing the surface equations (10.3) with 
curve equations 


Z' (0 = Z' (S (0) . 


( 10 . 88 ) 


Differentiating with respect to t, we find 


dZ‘ _ . dS a 

dt “ dt 


(10.89) 


Since the length of the curve is given by 



(5.20) 


by substituting (10.89) in (5.20), we find 



(10.90) 


which, by equation (10.19), yields the beautiful, albeit expected, result 



(10.91) 


Therefore, much of what was said in Sect. 5.8 regarding reconstructing the 
ambient metric tensor from curves lengths can be carried over to surfaces. The 
surface metric tensor can likewise be reconstructed from the lengths of curves that 
lie in the surface. Therefore, the metric tensor is intrinsic, despite the fact that 
definition (10.16) is extrinsic since it relies on the covariant basis. 

Intrinsic geometry on surfaces is much more exciting than in the ambient space. 
The interest comes from the fact that different surfaces can have identical intrinsic 
geometries. For example, a section of a plane can be deformed into a cylinder 
or a cone without changing the lengths of any of the curves contained within it. 
The word deformation refers to a pointwise mapping between two surfaces and the 
coordinate systems are such that corresponding points have identical coordinates. A 
deformation that preserves the lengths of curves is called an isometric deformation 
(or transformation ) or an isometry. Two surfaces related by an isometric deforma- 
tion are said to be isometric. For example, the catenoid and the helicoid shown 
if Fig. 10.3 are isometric. Isometries are often described informally as distance- 
preserving transformations. This description captures the spirit of the definition; 


10. 10 The Impossibility of Affine Coordinates 
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Fig. 10.3 The catenoid and 
the helicoid are famous for 
being minimal surfaces 
discovered by Euler and 
Meussnier. These surfaces are 
also famous for being 
isometric and therefore 
intrinsically identical. The 
catenoid is shown with a cut 
through and along which a 
discontinuity in the isometric 
transformation may occur 



however, the term distance, when used in the context of a curved surface, is vague. 
We therefore rely on the clear notion of length of curves. 

Since the metric tensor can be reconstructed from lengths of curves, isometric 
surfaces have identical metric tensors at corresponding points. This means that all 
objects derived from the metric tensor — most notably the upcoming Christoffel 
symbol and the Riemann-Christoffel tensor — are identical as well. As in the 
ambient case, these objects are called intrinsic and it can be said that isometric 
surfaces have identical intrinsic geometries. 


10.10 The Impossibility of Affine Coordinates 

Until this chapter, we have been discussing Euclidean spaces. In Chap. 2, while we 
did not give a formal definition of a Euclidean space, we agreed that we intuitively 
understood Euclidean spaces as being straight. A related property of a Euclidean 
space is the possibility of referring it to an affine coordinate system. An affine 
coordinate system is characterized by a constant metric tensor and a vanishing 
Christoffel symbol. 

On most curved surfaces, it is not possible to introduce an affine coordinate 
system (i.e., a coordinate system characterized by a vanishing Christoffel symbol), 
not even on a small patch. In other words, a curved surface is generally not a 
Euclidean field. So far, we have not developed the necessary analytical machinery 
to demonstrate this idea. This machinery will be developed over the next two 
chapters and the impossibility of affine coordinates will be convincingly presented 
in Chap. 12 on curvature, where a nonvanishing Gaussian curvature will provide 
irrefutable analytical evidence of this impossibility. 

There are several types of surfaces that are exceptions to this rule, namely, 
surfaces isometric to a plane. Two such surfaces are the cylinder and the cone. What 
these two surfaces have in common is that we can form them from a flat sheet of 
paper without sheering or stretching. In other words, we can form them by isometric 
deformations, that is. deformations that do not change distances between points. The 
discussion of these ideas is the highlight of Chap. 12 and possibly of this entire book. 
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10.11 Examples of Surfaces 

In all examples, the ambient space is referred to Cartesian coordinates x, y,z or 

x\,y\,z\. 


10.11.1 A Sphere of Radius R 


Consider a sphere of radius R (Fig. 10.4) given by 


x (9. <fi) = R sin 9 cos 0 
y (9,(p) = R sin 9 sin (f> 
z {9. <p) = R cos 9. 


Shift tensor 


ZL = 


R cos 9 cos (p —R sin 9 sin 0 
R cos 9 sin 0 R sin 9 cos 0 
— Rsin9 0 


Exterior normal 


N' 


sin 9 cos 0 
sin 9 sin 0 
cos 9 


(10.92) 

(10.93) 

(10.94) 


(10.95) 


(10.96) 


The normal N l can be obtained as the cross product of the “columns” of Z' a . 
However, in this simple case, we simply guessed the values of N' from geometric 
considerations. 

Metric tensors 


Safi — 


R 2 0 
0 R 2 sin 2 9 


S aP = 


R~ 2 0 

0 R~ 2 sin -2 9 


Area element 


VS = R 2 sin 9. 


(10.97) 


(10.98) 


Christoffel symbol. Letting 0=1 and T> = 2, the nonzero entries are 


rfs = — sin 9 cos 9 


(10.99) 


10. 1 1 Examples of Surfaces 
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Fig. 10.4 A sphere of radius 
R referred to a spherical 
coordinate system 



p‘f ) 

1 ©$ — 1 $0 


cot 9. 


( 10 . 100 ) 


10.11.2 A Cylinder of Radius R 

An infinite cylinder of radius R (Fig. 10.5) is given by 

( 10 . 101 ) 
( 10 . 102 ) 
(10.103) 

Shift tensor and exterior normal 


x\ ( 9 , z) = R cos 9 
yi (9,z) = RsmO 
zi ( 9 , z) = z. 



—R sin 9 

O' 


cos 9 

z* — 

R cos 6 

0 

; N‘ = 

sin 9 


0 

1 


0 


Metric tensors 


SotB 


R 2 0 
0 1 


S afl 


R~ 2 0 

0 1 


(10.104) 


(10.105) 


Area element 


VS = R (10.106) 

The Christoffel symbol vanishes indicated that the chosen coordinate system is 
affine. 
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Fig. 10.5 Infinite cylinder of 
radius R referred to 
coordinates Z 1 = 9, Z 2 = t, 



10.11.3 A Torus with Radii R and r 


Suppose that the torus (Fig. 10.6) is given by the equations 


x (9, <p) = (R + r cos <p) cos 9 
y ( 9 , <p) = (R + r cos <p) sin 9 
z{9.(p) = R sin (j), 


where 9 = S 1 and cp = S 2 . 


Shift tensor and exterior normal 


(10.107) 

(10.108) 
(10.109) 



— (R + r cos cp) sin0 

— r cos 9 sin cp 


cos 9 cos (p 

r 7 i - 

^ OL ~ 

(R + r cos cp) cos 9 

—r sin 9 sin cp 

; N‘ = 

sin 9 cos (p 


0 

r cos (p _ 


sin (p _ 


Metric tensors 


Raft 


( R + r cos </>) 0 
0 r 2 


; S ap = 


(R + r cos cp) 1 0 


0 


„-2 


( 10 . 111 ) 


Area element 


VS = r(R + r cos <p ) 


( 10 . 112 ) 


10. 1 1 Examples of Surfaces 
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Fig. 10.6 Torus 
characterized by radii R and 
/•. The surface is referred to 
coordinates S 1 = 9 and 
S 2 = cj> 



Christoffel symbol. Letting 0=1 and <t> = 2, the nonzero entries are 


R + r cos cj) 

1 0© - r 

(10.113) 

T.© T,e ? ‘ s in( t> 

l0<6_1 ® 0_ R + r cos (p 

(10.114) 

10.11.4 A Surface of Revolution 


Consider a surface of revolution in Cartesian coordinates x\ , Vi , zt 

is given by 

x\ ( 9 , z) = r (z) cos 9 

(10.115) 

yi (9, z) = r (z) sin 9 

(10.116) 

zt (9, z) = z. 

(10.117) 


Note that the cylinder (10. 101)— (10. 103) is a surface of revolution with r (z) = R. 
Shift tensor and exterior normal 


^ a ~ 

Metric tensors 

S/yft 


-r ( z ) sin 6 r' (z) cos 6 
r (z) cos 9 r' (z) sin 9 

0 1 


r cos 0 n 

Vi+'-'w 2 


N‘ = 


Vi+'-'b) 2 

r'(z) 

l yr+pe j 


(10.118) 


1 

< 

0 

CN 

5*». 

1 

• S a P - 

r (z) 2 0 

L 0 l + r' (z) J 


0 l+r'D 2 . 


(10.119) 
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Area element 

Vs = r (z) y/l + r' (z) 2 . (10.120) 

Christoffel symbols. Letting 0=1 and Z = 2, the nonzero entries are 


p© _ 

1 ez — 

r © r'(z) 

Z0 r (z) 

(10.121) 

r z — 

r (z) r' (z) 

(10.122) 


1 + r' (z) 2 

r z — 

r’ (z) r" (z) 

(10.123) 

1 ZZ — 

1 + r’ (z) 2 


Exercise 224. Derive the objects given in this section. 

Exercise 225. Rederive the objects on the sphere by referring the ambient space to 
spherical coordinates r \ , 9i , <f>\ in which the sphere is given by 

n(e,<p) = R (10.124) 

e l {6,<j)) = Q (10.125) 

<p\ (9, <p) = <j). (10.126) 


10.11.5 A Planar Curve in Cartesian Coordinates 


A curve embedded in a two-dimensional plane can be viewed as a hypersurface. 
Suppose that the plane is referred to Cartesian coordinates x, y and let the curve be 
referred to 5 1 = t . 

Shift tensor and normal 





r y'd) 1 

- 

i 

; V = 

y/x’(t) 2 +y'(t) 2 

x'(t) 



L Jx’(t) 2 +y'(t) 2 J 


(10.127) 


Metric tensors 


x' (o 2 + y o 0 2 


Sap = x 1 0 0 2 + S ( t ) 2 ; S' 


(10.128) 



10. 12 A Planar Curve in Polar Coordinates 
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Volume (or length) element 


= yjx' (tf + y' (tf 


(10.129) 


The Christoffel symbol has a single entry T with the value 

f _ x'(t)x"(t) + y'(t)y"(t) 
x' ( t f + y' (tf 


(10.130) 


Exercise 226. Show that when the curve is referred to the arc length s, the above 
expression simplifies to the following 


N ' , = 


y' 0 ) 
x’(s)] 


S afi = 1; S aP = U Vs = 1; f = 0 


(10.131) 


Note that f simplifies to x' ( s ) x" (s) + y' ( s ) y" (s) and you need to show that this 
expression vanishes identically. 

Exercise 227. Show that when the curve is given as a graph y = y (x), the same 
differential objects are given by 


Z' = 


1 

g' (*) 


N' = 


s afi = i + /W 2 ; s aP 


S = Jl + y' (x) 2 ; T 


" j'(a-) " 

Vi+yw 2 
1 

_ Vt+/w 2 _ 

l 

" i + y' ixf 

= y' (x)y"(x) 
1 + y' (xf 


(10.132) 

(10.133) 

(10.134) 


10.12 A Planar Curve in Polar Coordinates 

Suppose that the plane is referred to Cartesian coordinates ( r , 9) and let the curve 
be referred to S 1 = t . Let the curve be given by (r ( t ) , 6 (f )). 
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Shift tensor and normal 


r(t)8'(t) 



VW) 2 +rO)V(r) 2 

r(tr l r'(t) 


V r'{t) 2 +r(t) 2 8’(t ) 2 _ 


(10.135) 


Metric tensors 


S afi = r'(tf + r(t) 2 0' (t) 2 ; 



Area element 


VS = y/ r’ (tf + r (tf 9’ ( t f 


(10.137) 


The Christoffel symbol T? has a single entry with the value 


r (t) r' (0 0' (t) 2 + r' (t) r" (t) + r {tf 9’ (t) 9" (f) 
r' (tf + r (tf 9' {tf 


(10.138) 


Exercise 228. Show that when the curve is referred to the arc length s, the above 
expression simplify to the following 


r’(s) 1. Ni = \ r(s)9’(s) 


(10.139) 


“ ~ L^'CrU ’ ~ L-r (s)- 1 r 1 (s) 

Safi = 1; S aft = 1 
VS = 1; f = 0 


(10.140) 


(10.141) 


Again, note that f simplifies to r (s) r' (s) 9' (s) 2 + r' ( s ) r" (^) + r (sf 9’ (s) 9" ( s ) 
and it remains to show that this expression vanishes identically. 


10.13 Summary 

This chapter, devoted to the fundamental differential objects on surfaces, parallels 
Chap. 5 concerned with Euclidean spaces. Curved surfaces add new richness to 
tensor calculus, which proves time and again that the tensor framework is ideally 
suited for numerous aspects of differential geometry. This chapter was focused on 
the tangent and normal spaces as well as on the surface metrics. We discovered a 
number of important relationships involving the shift tensor and the normal. The 
most vivid relationships were captured by equations (10.23) and (10.55). 


Chapter 11 

The Covariant Surface Derivative 


11.1 Preview 

In the preceding chapter, we gave an overview of embedded surfaces. We now 
turn to the important question of covariant differentiation on the surface. We 
will divide the construction of the covariant derivative into two parts. We will 
first define this operator for objects with surface indices. The definition will be 
completely analogous to that of the covariant derivative in the ambient space. 
While the definition will be identical, some of the important characteristics of the 
surface covariant derivative will be quite different. In particular, surface covariant 
derivatives do not commute. Our proof of commutativity for the ambient derivative 
was based on the existence of affine coordinates in Euclidean spaces. Since affine 
coordinates may not be possible on a curved surface, that argument is no longer 
available. We will also discover that the surface covariant derivative is not metrinilic 
with respect to the covariant basis Sq.. This will prove fundamental and will give rise 
to the curvature tensor, which will be further developed in Chap. 12. 

Having defined the covariant derivatives for objects with surface indices, we will 
extend the definition to surface objects with ambient indices, such as the shift tensor 
Z' a and normal N ' . We will discover a chain rule that applies to surface restrictions 
of objects defined in the ambient space. The chain rule will show that the surface 
covariant derivative is metrinilic with respect to the ambient metrics. Finally, we 
will derive formulas for differentiating the shift tensor and the normal. 


11.2 The Covariant Derivative for Objects 
with Surface Indices 

We begin our discussion by constructing the covariant derivative as applied to 
objects with surface components only. These objects include the covariant and the 
contravariant bases S a and S a , metric tensors S a p and 5“^, and Levi-Civita symbols 
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s u p and s a P . For the time being, the covariant derivative will remain undefined for 
objects with ambient indices, such as the shift tensor Z' a and the normal N ' . 

Having introduced the Christoffel symbol T^ y , we can define the covariant 
derivative V Y for a variant 7J (with a representative collection of surface indices) 
by analogy with the ambient case: 

97“ 

v v r„“ = — * - + r“ - r%7;“ (li.i) 

Y p 9 Cy Y® P y p CO \ / 

On a curved surface, the Christoffel symbol is defined by equation (10.80) 


' dsy 


by analogy with the ambient equation (5.60) 


r*. = z k 


dZj 

dZJ 


since the analogy with the ambient equation (5.56) 

dz i . 

dzJ ~ TijZk 


( 10 . 80 ) 


(5.60) 


(5.56) 


may not be possible: the vector dSp / dS Y may very well lie out of the tangent plane 
due to curvature. This may be seen as the source of all analytical differences between 
the surface and the ambient covariant derivatives. 

The two main differences are (1) surface covariant derivatives do not commute, 
and (2) the metrinilic property of the surface covariant derivative does not hold 
for the surface covariant basis. The cause for both of these facts will be seen to 
be curvature. The replacement identities offer tremendous analytical and geometric 
insight into the properties of curvature. The discussion of these ideas is saved for 
Chap. 12, an entire chapter devoted to curvature. In this chapter, curvature and the 
modified commutativity equation are given but not discussed. 


11.3 Properties of the Surface Covariant Derivative 

In this section, we give an enumeration of the surviving properties of the covariant 
derivative. These properties can be demonstrated in the same way as those for the 
ambient covariant derivative. 

1. The covariant derivative produces tensors out of tensors. 

2. The covariant derivative V Y coincides with the partial derivative d/8S y when 
applied to variants of order zero. 


1 1 .4 The Surface Divergence and Laplacian 
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3. The covariant derivative satisfies the sum and product rules. 

4. The metrinilic property of the covariant derivative applies to all metrics, except 
for S„ and S“ : 


V y S a/ s, V y S“^ = 0 

(11.2) 

V y e ai 8, V Y e“^ = 0 

(11.3) 

Vy$, V y 9£ = 0 

(11.4) 


5. The covariant derivative commutes with contraction. 

Exercise 229. Show the tensor property of the surface covariant derivative. 

Exercise 230. Show the sum and product rules for the surface covariant derivative. 

Exercise 231. Show the metrinilic property of the surface covariant derivative. 

Exercise 232. Show commutativity with contraction of the surface covariant 
derivative. 


11.4 The Surface Divergence and Laplacian 

The covariant derivative gives immediate rise to the surface Laplacian V a V“. This 
operator is often referred to as the Laplace-Beltrami operator or as the Beltrami 
operator. It is particularly interesting that its geometric interpretation is not easy to 
see. On the other hand, its analytical definition provided by the tensor framework is 
entirely straightforward. 

The Voss-Weyl formula continues to apply. The surface divergence V a T a is 
given by 


V„T“ 


1 9 



The Laplacian of an invariant field F is calculated by the formula 


V V“F 


1 9 



(11.5) 


(11.6) 


Exercise 233. Show that surface Laplacian on the surface of a sphere (10.92)- 
(10.94) is given by 


WaV^F = 


1 9 / 9F \ 


1 d 2 F 


R 2 sin 2 0 d<p 2 ' 


(11.7) 


188 


1 1 The Covariant Surface Derivative 


Exercise 234. Show that surface Laplacian on the surface of a cylinder (10.101)- 
(10.103) is given by 


V a V“F 


1 d 2 F | d 2 F 

iF-Je 2 + ~ 3 ?“' 


(11.8) 


Exercise 235. Show that surface Laplacian on the surface of a torus (10.107)- 
(10.109) is given by 


V„V“ F 


1 a2p 1 3 / „ n 3F) 

(R + r cos (p) 2 dO 2 + r 2 (R + r cos <p) dcp \ + r cos 3 0 / 

(11.9) 


Exercise 236. Show that surface Laplacian on the surface of revolution (10.115)- 
(10.117) is given by 


V„V“.F = 


' (z) 7 1 + r f (z ) 2 d ~ l y 1 + r' (z) 


r ( z ) 


3 F 


dz 


+ 


1 3 2 F 

7W M 2 


( 11 . 10 ) 


11.5 The Curvature Tensor 

The curvature tensor arises out of the failure of the metrinilic property with respect 
to the covariant basis. Expand V„S/j according to definition (11.1): 

3S« 

v^= g^-r^Sy. (i1.1i) 

Note that V„S/j is symmetric 

V„S /! = V / ,S 0f (11.12) 

since the first term in (11.11) equals <i 2 R/<)S a dS^ and the second term is propor- 
tional to the Christoffel symbol V 2 ,, which is symmetric in its lower indices. 

Unlike the quantity V, Z y , which vanished by definition (5.56), the quantity 
does not vanish. After all, a direct analogue of equation (5.56) does not exist due to 
the curvature of the surface. Nevertheless, a geometric observation of great value can 
be made of V„ S p : each of these four vectors is orthogonal to the tangent plane. To 
put it another way, while the vector V^S^g does not vanish, its tangential components 
do. To show this, dot the identity (11.11) with S s : 


s' ■ v„s t = s* ■ - r'jS, ■ s’. 


(11.13) 
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Since S Y • S' 5 = S', we find 


S 4 • V a S/, 



and the right-hand side vanishes by the definition (10.80). Thus, 


S' • VaSp = 0 


(11.14) 


(11.15) 


and the orthogonality of V„S/j to the tangent plane is established. 

This geometric observation leads to the definition of the curvature tensor. Since 
V„S/j is normal to the plane, it must be a multiple of the normal vector N. Let that 
multiple be B u p : 


VaS^ = NB a p. (11.16) 

The variant B a p is called the curvature tensor. Dotting both sides of this identity 
with N yields an explicit expression for B a p 

B a p = N • V a Sp (11.17) 

and confirms its tensor property. Note that the sign of B a p depends on the choice of 
the normal. 

The curvature tensor plays a central role in the analysis of embedded surfaces. 
So important is its role that we have devoted an entire chapter to its study. Having 
shaken hands with it in this chapter, we part with the curvature tensor until our 
in-depth investigation in Chap. 12. 


11.6 Loss of Commutativity 

Recall the argument by which we proved that ambient covariant derivatives 
commute. We evaluated the commutator 


( V; Vy Vy V/ ) 7^ (11.18) 

in affine coordinates where the covariant derivatives coincide with partial derivatives 
and therefore commute. As a result, the commutator vanishes in affine coordinates 
and, being a tensor, in all coordinates. 

This argument is not available on curved surfaces since there may not exist an 
affine coordinate system. As a result, commutativity is lost. Instead, the commutator 
is governed by the Riemann-Christoffel tensor R y Sol g, introduced in the ambient 
space in Chap. 8. It is straightforward to show that for a contravariant tensor T ‘ . the 
commutator takes the following form 
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where 


(V^-V^) TV = R? Sap T s 


(11-19) 


R 


Safi 


35" 


3 S? 


+ r Y 

1 m 


pul 

otco ^ f$8 


pY p&) 

j Bco^ ot8’ 


( 11 . 20 ) 


Example 237. Derive equations (1 1.19) and (11.20). 

An in-depth discussion of the Riemann-Christoffel tensor takes place in 
Chap. 12. 


11.7 The Covariant Derivative for Objects 
with Ambient Indices 

11.7.1 Motivation 

The clue to the covariant derivative for objects with ambient indices comes 
from differentiating an invariant field T (such as the surface normal N) and its 
components T l . We are looking for an operator that satisfies the product rule and 
coincides with the partial derivative when applied to variants of order zero. For an 
operator V Y that satisfies these properties, the following chain of identities would be 
valid: 


3T dirzA dr . 3z,- 
V v T = — = ’ = — Z i + T —T 


dS y 3 S- 

Apply the chain rule to the second term 


3S V 


ds v 


dr 

VyT = — Z t + T 


... 3Z; dZJ 


V dS y 3 zi dSv 


and recognize that 9Z//3Z 7 = and 3Z y /dS y = Zy : 


dT l 

v y T = -z, + rz^r‘z t . 


35. 


( 11 - 21 ) 


( 11 . 22 ) 


(11.23) 


The right-hand side can be rewritten as 


11.7 The Covariant Derivative for Objects with Ambient Indices 


191 


Finally, if we demand that V y is metrinilic with respect to Z ,• , the left-hand side can 
be transformed by the product rule 

Vyr Z ' = {w y + Z v kr *» rm ) Z (1L25) 

This identity suggest the definition 

dT l 

v Y r = + zk y n,n Tm (11-26) 

and indeed this good guess leads to a definition that has all of the desired properties. 


11.7.2 The Covariant Surface Derivative in Full Generality 


Motivated by equation (11.26), we give the following definition for the covariant 
surface derivative applied to a variant Tj with ambient indices: 


dT i » 

V '-ri J . r y k pi r rtn 7/cpffl p<i 

y 1 j — T77; I 1 lrrr, 1 i ~ 1 lei 1 m • 


as. 


V km 1 j 


"Y 1 kj 1 m • 


(11.27) 


To give the ultimate definition, consider a variant T'f with a fully representative 
collection of indices. That definition reads 


V T lct — 
1 - 


3T'“ , . , 

JP 1 ^/c-pz r T i moi 'ykpm '-pi a. . pa pico ptw pza 

g£ ' ^Y ^knPjfi kj 1 mfi 1 Tco 1 jfi i Y P l jW 


(11.28) 


We leave it as an exercise to prove each of the following fundamental properties. 

1. The covariant derivative produces tensors out of tensors. 

2. The covariant derivative V Y coincides with the partial derivative d/dS y when 
applied to invariants (or any variants of order zero). 

3. The covariant derivative satisfies the sum and product rules. 

4. The metrinilic property of the covariant derivative applies to all metrics, except 
for S„ and S“ : 


v y s a p, v y s afi = 0 

(11.29) 

V y e a p, V y e“^ = 0 

(11.30) 

V y $, V y 8p = 0 

(11.31) 


5. The covariant derivative commutes with contraction. 
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11.8 The Chain Rule 


Consider a variant field Tj defined in the ambient space. We may also consider the 
restriction of Tj to the surface S and apply the covariant derivative to this restriction 
by definition (11.27) 


V T' 

\yl j 


dTj 

dS, 


I ^/CT-lZ rpm 

^1 1 km 1 j 


yk pff) rjii 

^Y 1 kj 1 rrr 


(11.32) 


Being a restriction of an ambient field, the function Tj ( S ) can be represented by 
the composition Tj (S) = Tj (Z ( S )). Therefore, the partial derivative term can be 
transformed by the chain rule 


V T‘ 
v y 1 j 


dTj dz k 

9z* asy 


+ Zy r, 


ym 
km 1 j 


yk -pm yi 
Zj y l kj 1 m 


(11.33) 


The object dZ"'/dS v is recognized as the shift tensor Z”\ which can now be 
factored out from each term 


V T' — 

\ylj - 


+ n 


Y' m T l 

km 1 j 1 kj 1 m 


(11.34) 


The quantity in parentheses is recognized as V). Tj and we conclude that 


VyTj = Z k S7 k Tj. 


(11.35) 


This elegant relationship is known as the chain rule. It states that the covariant 
surface derivative of the restriction of an ambient field is the projection of the 
ambient covariant derivative. 

The chain rule is ubiquitous in tensor calculations and has the following 
immediate implication: the covariant surface derivative is metrinilic with respect 
to all ambient metrics: 


V Y Z,, V Y Z' 

= 0 

(11.36) 

V Y Zy, V v z" 

= 0 

(11.37) 

Vy*, V y e iJk 

= 0 

(11.38) 

V'-J, v Y ^ 

= 0 . 

(11.39) 


One of the important implications of the metrinilic property (1 1.37) is that ambient 
indices can be juggled “across” the covariant derivative. For example, S, V Y T‘ = 
S'WyTj. and Sj Y = V v Tj implies S' y = V Y T‘ . 

Exercise 238. Justify equations (1 1 .36)— ( 1 1.39). 


1 1.9 The Formulas for V^Zt and V a N' 
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11.9 The Formulas for V„Z^ and V a N l 

There are two outstanding elements on the surface, the shift tensor and the 
normal N' , for which we have not yet established differentiation rules. The 
metrinilic property (11.36) of the covariant derivative yields the rule for the shift 
tensor. Recall the definition (1 1.16) of the curvature tensor: 

V a Sp=SB a p. (11.16) 

By the product rule and the metrinilic property, we find: 

V«S„ = V„ (z'Z,) = V a Z^ Z,. (11.40) 

Decompose the normal N on the right-hand side of equation (11.16) with respect 
to Z, 


V„Z‘ Z, = N ' B a p Z,- . (11.41) 

Equating the vector components gives the desired formula 

VaZ^j = N’ B a p. (11.42) 

Contracting both sides of this equation with /V, gives an explicit expression for the 
curvature tensor: 

B a p = NiV a Zp. (11.43) 

We now turn to the normal N l . One way to derive the expression for V„ N ' is by 
appealing to the explicit expression (10.38) for N' . Alternatively, we can appeal to 
the two equations (10.39) and (10.41) that define N' implicitly. We pursue the latter 
approach first. 

Applying the covariant derivative to equation (10.41) 

NjN' = 1, (10.41) 

we find 

NjV a N‘ =0. (11.44) 

Next, juggle the index i in equation (10.39) 


N l Z ia = 0 


(11.45) 
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and apply the covariant derivative. By the product rule, 


WpN’ Z ia + N l V p Z ia = 0. 


(11.46) 


By equation (1 1.43), the second term is the curvature tensor B a p. Thus, 


VpN' Z ia = —B a p. 


(11.47) 


To isolate V pN l , contract both sides with Z' y/ 


VpN 1 Zj a Z aj = —Z aj B a p. 


(11.48) 


By the projection equation (10.55), the left-hand side is 


VpN J - NjN J VpN' = — Z" B a p 


(11.49) 


Since the second term on the left-hand side vanishes by equation (1 1.44), we arrive 
at the ultimate expression for V a N 1 



(11.50) 


Equation (1 1.50) is also known as Weingarten ’s formula. 

Exercise 239. Derive equation (1 1.44) from (10.41). 

We now derive Weingarten’s formula (11.50) from the explicit expression (10.38) 
for the normal. We leave as an exercise to derive the following identity 



(11.51) 


The geometric interpretation of this identity is straightforward: the cross-product- 
like combination of the normal and a covariant basis element yields a tensor 
orthogonal to both, that is, a corresponding contravariant basis element. This 
fundamental relationship is, surprisingly, not needed often, but finds a direct 
application in the following derivation. 

Exercise 240. Confirm equation (11.51) by substituting an explicit expression 
for Nk- 

An application of the covariant derivative to equation (10.38) 



(10.38) 
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yields 

V y JV f = l -e ijk e a p (v y Z“ zf + Z“ V y zf ) . (1 1.52) 

The terms in parentheses yield curvature tensors by equation (11.42): 

V y N‘ = l -s ijk e afi ( Nj B“ZI + Z)N k B f ) . (1 1.53) 

Multiply out the right-hand side 

V y JV'' = y^eapNjB^Z^ + ^ e ijk e a p Z" N k Bff . (11.54) 

The two terms are actually the same, which can be seen after a few index-renaming 
steps. Therefore, 

V y iV'' = e^SapNj B“Z* (1 1.55) 

and Weingarten’s formula (1 1.50) follows by an application of equation (1 1.51). 


11.10 The Normal Derivative 

Consider an invariant function u defined in the ambient space. At the points on an 
embedded surface S , we may be interested in the directional derivative of u along the 
normal N k . The directional derivative in this special direction is called the normal 
derivative and is often denoted by du/dn. According to equation (2.1 1), the normal 
derivative is given by 


— = )V'V,h. (11.56) 

an 

The normal derivative is a ubiquitous concept in applied mathematics, physics, and 
engineering. 

According to equation (1 1.56), the normal derivative du/dn can be thought of as 
the normal component of the ambient gradient V, u. Suppose that normal derivative 
du/dn and the covariant surface derivative V„ u are known at a particular point on S. 
Then the full ambient derivative V, u can be reconstructed by the formula 


v,-h = zy aU + Ni — • 


(11.57) 
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To demonstrate equation (11.57), rename the index i in definition (11.56) and 
multiply both sides by IV; , 


N i — = N i N J S7. u. (11.58) 

3 n 

By the projection rule (10.55), 

(1L59) 

Expanding, we find by the chain rule 

3 u 

Nj — = V; u - (1 1.60) 

3 n 

which is equivalent to equation (1 1.57). 

A number of beautiful relationships exist that relate the second-order ambient 
and surface covariant derivatives of an ambient variant. Perhaps the most commonly 
encountered one is 


N' N j V; Wj U = V;V' m- V a V“l<+ (11.61) 

The derivation of this equation is left as an exercise. Equation (1 1.61) has a number 
of valuable interpretations. One of them is the relationship of the surface Laplacian 
V„ V"t; to the ambient Laplacian V, V'm. This perspective can be brought to the 
forefront by writing equation (1 1.61) in the form 

V„ = V;V'm- N i N j V i V j u+ (11.62) 


Exercise 241. Derive equation (11.61). Hint: Transform the surface Laplacian 
V„ V"m by the chain rule (11.35) starting with the innermost covariant derivative 
V“. 

Exercise 242. Show that 


3 n 2 


rfNjVtVj. 


(11.63) 


You may find it helpful to refer the ambient space to an affine coordinate system. 


11.11 Summary 
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11.11 Summary 

In this chapter, we constructed the covariant derivative on embedded surfaces. When 
applied to objects with surface indices, the definition of the covariant derivative 
was analogous to that in the ambient Euclidean space. However, since a complete 
analogy in the definition of the Christoffel symbol is not possible, two properties — 
the metrinilic property and commutativity — had to be modified. Both alterations are 
associated with curvature. The loss of the metrinilic property with respect to the 
covariant basis led to the introduction of the curvature basis. Loss of commutativity 
led to the nontrivial Riemann-Christoffel tensor and an alternative introduction of 
Gaussian curvature. 



Chapter 12 

Curvature 


12.1 Preview 

This chapter is devoted to the study of curvature. This is an exciting and beautiful 
topic. The highlight of this chapter, and perhaps the entire book is Gauss’s Theorema 
Egregium, or the Remarkable Theorem. 

The first half of this chapter is devoted to the intrinsic analysis of surfaces. 
Intrinsic analysis deals with quantities derived from the metric tensor and its 
partial derivatives. The central object in the intrinsic perspective in the Riemann- 
Christoffel tensor R a p y s that arises in the analysis of the commutator V a Vp — 
V^Vq,. The second half of the chapter focuses on the curvature tensor B a p, an 
extrinsic object because it depends on the way in which the surface is embedded 
in the ambient space. Theorema Egregium relates the Riemann-Christoffel and the 
curvature tensor and thus links the intrinsic and the extrinsic perspectives. 

It is exciting to have the opportunity to come face to face with Carl Friedrich 
Gauss (Fig. 12.1) and his protege Bernhard Riemann, two of the greatest mathe- 
maticians in history. These amazing scientists spent their entire lives pondering the 
concept of space and curvature and attained a depth of understanding unmatched to 
this day. The equations contained in this chapter flow effortlessly from one to the 
next and their beauty is a tribute to the remarkable achievements of those great men 
and their predecessors. 


12.2 The Riemann-Christoffel Tensor 

We begin with a discussion of the intrinsic elements of the surface. In the previous 
chapter, we undertook the critical question of whether covariant surface derivatives 
commute. In this chapter, we discover that the general answer is no. 


P. Grinfeld, Introduction to Tensor Analysis and the Calculus of Moving Surfaces , 

DOI 10.1007/978-1-4614-7867-6-12, © Springer Science+Business Media New York 2013 
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Fig. 12.1 Carl Friedrich 
Gauss (1777-1855) was a 
pivotal figure in differential 
geometry and mathematics in 
general 



The commutator V a V a applied to a variant T y with a surface superscript 

is governed by the rule 


( v <* v /i - V^V„) T y = R y Safi T s , 
where the Riemann-Christoffel tensor R y Sct a is given by 


(11-19) 


R 


3rL ar 7 , 

pb aS | yiy -pet) py pcti 

•Safi 3S a dS@ ' ^ aQ) ^ fico^ aS' 


( 11 . 20 ) 


Introduce the Riemann-Christoffel tensor R y s a p with the lowered first index 

RySotfi = SycoR^Safi- (12.1) 


Exercise 243. Show that 


RySotfi — 


3r y t p$ 

ds a 


3r y , a « 

dSP 


+ r®.y^r, 


aS 


v r® 

A a),ya*- fi8 • 


(12.2) 


In the ambient Euclidean space, the analogous expressions are found in equations 
(8.131) and (8.130). However, due to the Euclidean nature of the ambient space, the 
ambient Riemann-Christoffel tensor vanishes and therefore covariant derivatives 
commute. The vanishing of the Riemann-Christoffel tensor is shown in two ways. 
The first approach relies on the evaluation of the Riemann-Christoffel tensor in 
Cartesian coordinates. The second approach, outlined in Exercise 146, relies on 
substituting an explicit expression for the Christoffel symbol into the definition of 
the Riemann-Christoffel tensor. Neither approach carries over to curved surfaces: 
the surface may not admit a Cartesian coordinate system and the partial derivative 
of the contravariant basis may have a normal component and therefore may not be 
re-expressed in the contravariant basis. 
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From the definition (11.19) and equation ( 1 1 .20), it can be easily seen that R y s a p 
is antisymmetric in a and 


Ry&fta — RySafii 


(12.3) 


as well as y and <5 


R&yaP = -RySafi- (12.4) 

A little bit more work is required to show the R a p y s is symmetric with respect to 
switching the pairs of indices aft and y8 : 


RySccfi — RafiyS' 


(12.5) 


Exercise 244. Show that 


R'. 


ayS 


o. 


(12.6) 


Exercise 245. Show relationships (12.3)— (12.5). 

Exercise 246. Show that equation (12.4) follows from equations (12.3) and (12.5). 
Exercise 247. The Ricci tensor R u p is defined as 


R*P = Km- ( 12 . 7 ) 

Its trace 

R = R a a ( 12 . 8 ) 


is called the scalar curvature. Show that the Ricci tensor is symmetric. 

Exercise 248. The Einstein tensor G a p is defined by 

G a p = R a p — - RS a p . (12.9) 

Show that the Einstein tensor is symmetric. 

Exercise 249. Show that, for two-dimensional manifolds, the trace G“ of the 
Einstein tensor G a p vanishes. 

Exercise 250. Show that for a covariant tensor T y , the commutator relationship 
reads 


(V„v^ - VpV a ) T y = -R s yafl T s . 


(12.10) 
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Exercise 251. Show that covariant derivatives commute when applied to invariants 


and variants with ambient indices 

(v a v /} -v /J v a )r = o (12.11) 

(V c Vp- VyjVa) r = 0 (12.12) 

(V^- =0. (12.13) 

Exercise 252. Show the first Bianchi identity 

RaPyS "b RaySp T RaSPy = 0- (12.14) 

Exercise 253. Show the second Bianchi identity 

V E R a p y 8 + VyRupSe + VsRciPey = 0. (12.15) 


Equations (12.12) and (12.13) can be shown rather easily by a direct application 
of the definition of the covariant derivative. However, there is a more elegant way of 
showing this. For a variant T' , form the vector T = T' Z, . By equation (12.1 1), 

V„V /i T-V /i V„T = 0. (12.16) 

Since the surface covariant derivative is metrinilic with respect the ambient basis 
Z,- , we have 


(v a v^r - v^v a r)z,- = o (12.17) 

and equation (12.12) follows. 

So far we have analyzed the commutator V a V„ for variants of order one. 

Applying the commutator to variants of order greater than one results in a sum of 
appropriate Riemann-Christoffel terms for each surface index. For example, for a 
tensor Tjg with a representative collection of indices, the commutator is given by 

(Wp - V^) T% = Ri ap T# - R% a p T*y . (12.18) 

Equation (12.18) shows that, from the index manipulation point of view, the role of 
the Riemann-Christoffel tensor in the commutator is similar to that of the Christoffel 
symbol in the covariant derivative: superscripts are contracted with the second index 
of +R% and subscripts are contracted with the first index of —R s Ya p- 

To show equation (12.18) for a variant Tl (dropping the less interesting ambient 
indices), define 


77 S d 


(12.19) 
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where is an arbitrary variant. Since 

V„v^:r = v a v p T y s s + v p T*v a s s + v a T y v p s s + T y v a v p s s (12.20) 

and the sum of the middle two terms on the right-hand side is symmetric in a and 
/3, we find 

V a Vf)T Y - VpVaT? = (v a v^r/ - v p v a T y ) S s + T y (V a V„S s - V p V a S s ) , 

( 12 . 21 ) 

or 

(v a v p T s y - v p v a r y ) s * = rj^T” - r L p s“T y . (12.22) 

Switching the names of the indices y and o> in the last term, we find 

(V a V p T s y - V p V a T s y ) S s = r^TfS* - r^T y S s . (12.23) 

Since S'" is arbitrary, equation (12.18) is confirmed. 


12.3 The Gaussian Curvature 

For a two-dimensional surface, equations (12.3)— (12.5) show that the Riemann- 
Christoffel symbol has a single degree of freedom R 1212 and the four nonzero entries 
are ± /s’ 1 7 1 2 • Thus, the Riemann-Christoffel tensor can be captured by the formula 

RySaP = R\2ne y 8 e ciP- (12.24) 


Switching from permutation systems to Levi-Civita symbols, we find 

RySctP = ^ SySSa, 3- (12.25) 

The invariant quantity 

Rl21? 

K = pp (12.26) 

is known as the Gaussian curvature K. It is one of the most fundamental and 
interesting characteristics of a surface. In terms of the Gaussian curvature K, the 
Riemann-Christoffel tensor is given by 


RySafi — KSy$B aP . 


(12.27) 
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Exercise 254. Derive the following explicit expression for K : 

K = ^s^RySap. (12.28) 

In particular, equation (12.28) shows that K is a tensor. 

Exercise 255. Show that 


RySafi = K ( S a ySp8 ~ S a sSp y ) . (12.29) 

Exercise 256. Show that the Gaussian curvature K is also given by 

K = Vjj . (12.30) 

Equivalently, 

K= l -R a a , (12.31) 

where R a p is the Ricci tensor. In other words, for a two-dimensional surface, the 

Gaussian curvature is half of the scalar curvature. 


12.4 The Curvature Tensor 

We now turn to the extrinsic elements on the surface. The primary extrinsic object is 
the curvature tensor B a p (also known as the extrinsic curvature tensor ) introduced 
in Chap. 1 1 in equation (11.16): 


V«S/, = N B aP . (11.16) 

In component form, this equation reads 

VaZ^j = N’ B a p. (12.32) 

The combination N' B® is called the curvature normal. The vector invariant N B“ 
may be referred to as the vector cun’ature normal. 

Equation (11.16) can be written in the following elegant way 

V a V /j R = N B aP . (12.33) 

Thus, the surface Laplacian of the position vector R yields the vector curvature 
normal of the surface: 


V a V"R = N5“. 


(12.34) 
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Contract both sides of equation (12.32) with Nj to obtain an explicit expression 
for B a p\ 


B a p = (V, V„Z' . (12.35) 

Recall that B a p depends on the choice of normal. The combination N' B a p is 
independent of that choice. 

Exercise 257. Show that B a p is symmetric 

B a p = Bp a . (12.36) 

The trace Bfi of the curvature tensor B“, is the mean curvature. Another important 
invariant associated with the curvature tensor is the determinant | B' of Bp. The 
trace B " and the determinant | B. | are the two invariants associated with the tensor 
B^ in the linear algebra sense of similarity transformations in a two-dimensional 
space. These invariants can be expressed in terms of the eigenvalues K\ and at? of 
Bp which are real since B a p is symmetric. The quantities at and at are called the 
principal cur\’atures of the surface. Their geometric interpretation is discussed in 
Chap. 13 on curves. In terms of the principal curvatures, B“ and B' are given by 

B“ = Ki+ k 2 (12-37) 

\B:\=kik 2 . (12.38) 

Equation (12.37) is the reason behind the word mean in mean curvature. Some 
texts define the mean curvature as the average (at + k 2 ) /2. This definition certain 
advantages over our definition in a few situations. For example the mean curvature 
of a sphere according to our definition is —2/R. In the alternative definition, the 
value is — 1 / R, which may be more consistent with our intuition of curvature. 
However, we prefer our definition since in numerous applications, and particularly 
in the calculus of moving surfaces, it is the quantity B“ (rather than 6“ /2) that is 
most frequently encountered. 

The metric tensor S a p is sometimes referred to as the first groundform of the 
surface and the curvature tensor B„p is the second groundform. There is also the 
third groundform Cp defined as the “matrix square” of the curvature tensor 

C a p = B a y B y p. (12.39) 

The third groundform is encountered frequently in applications. 

The extrinsic nature of the curvature tensor can be illustrated by forming a 
cylinder out of a sheet of paper. When the sheet of paper is flat, the curvature tensor 
B a p vanishes since V a Sp does not have a normal component. When the sheet is 
curved into a cylinder without stretching and each material particle is allowed to 
keep its coordinates, the metric tensor and its derivatives are unchanged. This is 
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because the metric tensor can be calculated from the unchanged lengths of curves 
embedded in the surface. Meanwhile, the curvature tensor does change: the mean 
curvature of the cylinder with respect to the exterior normal is — 1 / R. Therefore, 
the curvature tensor depends on the way in which the surface is embedded in the 
ambient Euclidean space. 

Exercise 258. Show that the following relationship holds at points of zero mean 
curvature: 


b;bp = -2\b:\. 


(12.40) 


12.5 The Calculation of the Curvature Tensor for a Sphere 


As an illustration, we calculate the curvature tensor for a sphere of radius R. In 
Chap. 10, we calculated the shift tensor, the metric tensors and the Christoffel 
symbols for a sphere referred to Cartesian coordinates in the ambient space and 
the standard spherical coordinates on the surface. The identity 

B at ; = -ZjvVpN 1 (12.41) 


is a form Weingarten’s formula (11.50) and offers a convenient way for evaluating 
the curvature tensor. The advantage of equation (12.41) is that it does not require the 
use of the surface Christoffel symbol and, additionally, does not produce any “three- 
dimensional” objects such as V„Z^ which cannot be represented by matrices. Recall 
that Z ia is given by the matrix 


M\ 


R cos 9 cos cp 
R cos 9 sirup 
-R sin 9 


—Rsin9 sirup 
R sin 9 cos <p 
0 


The object N' = 3 N' / dS & is given by the matrix 


(12.42) 


M 2 


cos 9 cos cp 
cos 9 sin (p 
— sin 9 


— sin 9 sirup 
sin 9 cos (p 
0 


(12.43) 


Therefore, the curvature tensor B a p is represented by the matrix product —M '[ M 2 


Bufi 


—R 0 
0 -R sin 2 9 


(12.44) 


The curvature tensor /i" is obtained by contraction with the contravariant metric 
tensor S in equation (10.97) 


a 

p 


0 

0 -i 


B 


(12.45) 
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Thus, the mean curvature B“ is 


B 


a. 

a 


2 

R' 


Finally, the contravariant curvature tensor B U ' J ' is given by 


B aft = 


t 



0 

t 

Rl sin 2 9 


12.6 The Curvature Tensor for Other Common Surfaces 

The cylinder ( 10. 101 )-( 10. 103) 

Rafi 

S“ 


'-R O' 

; Ba = 

' 4 ®' 

; B aP = 

1 

0 

-IBs 

1 

1 

0 0 

’ p 

00. 


00_ 


1 

R 


Cone with angle 0 : 


B u 


r cos 0 sin 0 0 

; B a R = 

" _ cot(9 q “ 
r 

; B af> = 

0 0 

’ p 

0 0 



COS 0 
r 3 sin 3 0 

o 


B a = - 


cot0 


The torus (10. 107)-(10. 109): 

Raft — 
B% = 


■ (R + r cos <p) cos(f> 0 
0 — r 


COS 0 

R+r cos 0 


B 


afi 


COS0 


( R+r cos (j ) ) 


o 


b:: 


0 — r 

R + 2r cos (f> 


(12.46) 

(12.47) 

(12.48) 

(12.49) 

0 

0 

(12.50) 

(12.51) 

(12.52) 

(12.53) 

(12.54) 


r (R + r cos </>) 


(12.55) 
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The surface of revolution (10. 1 15)— (10. 117): 



Vl +r'(z ) 2 _ 


(12.56) 



(12.57) 



(12.58) 


(12.59) 


Exercise 259. Show that the catenoid given by r (z) = a cosh (z — b) /a has zero 
mean curvature. 


12.7 A Particle Moving Along a Trajectory Confined 
to a Surface 

This section parallels Sect. 8.9, in which we analyzed the motion of a particle along 
a trajectory in a Euclidean space. The present problem is substantially richer since 
the acceleration of the particle depends on the curvature tensor of the surface. The 
content of this section is presented in the form of exercises. 

Exercise 260. Suppose that a particle confined to the surface moves along the 
trajectory y ( t ) given by 


S a = S a ( t ). 


(12.60) 


Show that its velocity V is given by 


V = E“S a 


(12.61) 


where 



(12.62) 


dt 


Hint: R(t) = R(S (?))■ 
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Exercise 261. Show that the acceleration A of the particle is given by 

gV°> 

A = — S a + NB ap V a V p , (12.63) 

ot 

where 

&V 01 dV a „ a „ 

TT = TiT + ^ vfrr - {12M) 

Conclude that 8V a /St is a tensor with respect to coordinate transformations on the 
surface. Also note the term N B a « V a V ,P ' known as the centripetal acceleration. 

Exercise 262. Use this problem as an opportunity to develop a new S/S t -calculus 
motivated by the definition (12.64). For a general tensor 7J, define the 5/St- 
derivative along the trajectory as follows 


ST? dT% 

-± = ^t + v y t; w t; - v y r“pT“. (12.65) 


This derivative is also known as the intrinsic derivative. 

Exercise 263. Show that the 8/ St -derivative satisfies the tensor property. 
Exercise 264. Show that the 8/ St -derivative satisfies the sum and product rules. 
Exercise 265. Show that the 8/ St -derivative satisfies the chain rule 


57“ 

IT 


ST- (t, S) 
3 1 


+ V y S7 y T$ 


( 12 . 66 ) 


for trajectory restrictions of time-dependent surface tensors 7J. 

Exercise 266. Conclude that the 8/S t -derivative is metrinilic with respect to all the 
surface metrics, except $„ and S“. 

Exercise 267. Show that 

= N V^B Ya . (12.67) 

ot 

Exercise 268. Show that 8 /St commutes with contraction. 

Exercise 269. Use the newly developed calculus to rederive equation (12.63) and 
to derive the following equation for the jolt J: 


J = 


S 2 V a 

~si r 


b;b yS 


V^V Y V' 



sv a \ 

— — + y a Bp Y V a V^V y ) N. 


(12.68) 
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12.8 The Gauss-Codazzi Equation 


Apply the general commutator equation (12.18) to the shift tensor Z l 

v a v*z‘ -v*v„z‘ =-*^zj. 


(12.69) 


By equation (12.32), we find 


v a (A TiBfiy) - Vp (N‘B aY ) = -R s ya pZ' s . 


(12.70) 


Applications of the product rule and Weingarten’s formula (11 .50) yield 


- Z\B s a Bpy + Z‘ s B s fi B ay + N'V a Bp Y - N‘VpB ay = -R^pZf (12.71) 


This equation is called the Gauss-Codazzi equation. Its normal projection is known 
as the Codazzi equation and its tangential projection is the celebrated Theorema 
Egregium discussed in the following section. 

Contract both sides of equation (12.71) with the normal A) to obtain the Codazzi 
equation: 


V a Bpy ~ VpBcty = 0 . 


(12.72) 


The Codazzi equation states that V a Bp y is symmetric with respect to a and /I 


V a Bpy — V p B ay . 


(12.73) 


Since the curvature tensor is symmetric, the Codazzi equation tells us that V„ Bp y is 
fully symmetric with respect to its indices. 


12.9 Gauss’s Theorema Egregium 

We have arrived at one of the most beautiful relationships in differential geometry: 
Gauss’s Theorema Egregium, translated from Latin as the Remarkable Theorem. It 
is contained as the tangential component in equation (12.71) and can be revealed by 
contracting both sides with Zf (and subsequently renaming o> )'): 



(12.74) 


This celebrated equation is also known as the Gauss equation of the surface. It 
usually appears in the form with lowered indices: 
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In Gauss’s own words: “Thus, the formula of the preceding article leads itself to 
the remarkable Theorem. If a curved surface is developed upon any other surface 
whatever, the measure of curvature in each point remains unchanged.” 

Theorema Egregium makes the following powerful statement: the combination 
Bay Bps — Bp y B a s, being the Riemann-Christoffel tensor, is intrinsic. That is, in 
can be expressed in terms of the covariant metric S a p and its derivatives. In effect, it 
means that the object B ay Bps — Bp y B a $ can be calculated by measuring distances 
on the surfaces. 

We will now derive several elegant equivalents of Theorema Egregium. Recall 
equation (12.28), in which the Riemann-Christoffel tensor is expressed in terms of 
the Gaussian curvature K 


BySaP — Ks a pE y 8. 


(12.28) 


Equation (12.28) yields 


BayB ps Bp y B a g K8 a p8y8 , (12.76) 

as well as an explicit formula for the Gaussian curvature in terms of the curvature 
tensor 


K = Ig-V* (B ar Bpt - BpyBas) . (12.77) 

Multiply the right-hand side of (12.77) and express the Levi-Civita symbols in terms 
of the permutation systems e and e yS : 

K = ^e^e yS B a yBp S - ^ e ^e yS Bp y B aS . (12.78) 

One easily recognizes that each of the two terms on the right-hand side equals 
(2S) -1 \B..\, where II. denotes the determinant of the curvature tensor B a p. 
Therefore, 


K = S~ l ]5..| . (12.79) 

Since is the determinant of the contravariant metric tensor 5“^, we find 


K = |S"| |£..| . (12.80) 

By the multiplication property of the determinant (9.28), we arrive at the ultimate 
expression of Theorema Egregium: 


k=\b:\. 


(12.81) 
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In words, Gaussian curvature equals the determinant of the curvature tensor Bp. In 
particular, Gaussian curvature equals the product of the principal curvatures 

K = KiK 2 . (12.82) 

Now, consider another way of manipulating equation (12.76) 

Bay B fi8 B fly B a § Ks a pSy§. (12.76) 

Raise the indices a and /) and contract a with y 

B a a B% - B^Bf = Ks^sas. (12.83) 

Since s a ^s a s = Sg , we find 

= K8$. (12.84) 

This new form of Theorema Egregium relates the first, second, and third ground- 
forms of the surface. 


Cap = H B afi - K S afi . (12.85) 

Exercise 270. Show that the Gaussian curvature of the cylinder (10.101)- 
(10.103) is zero: 

K = 0. (12.86) 


Exercise 271. Show that the Gaussian curvature of a cone is zero: 

K = 0. (12.87) 

Exercise 272. Show that the Gaussian curvature of the sphere (10.92)-(10.94) is 


1 

K = lv-- 


(12.88) 


Exercise 273. Show that the Gaussian curvature of the torus (10.107)-(10.109) is 

cos (j) 


K = 


r (R + r cos < 


(12.89) 


Exercise 274. Show that the Gaussian curvature of the surface of revolution 
(10. 1 15)— (10. 117) is 

r"{z) 

r (z) (l + r'{zf} 


K = - 


(12.90) 
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12.10 The Gauss-Bonnet Theorem 


The Gauss-Bonnet theorem is a fundamental result in differential geometry and 
topology. It states that for a closed surface, the total curvature, defined as the integral 
of the Gaussian curvature K, depends on the genus of the surface and not its shape. 
The genus g of a surface is the number of topological holes. For example, the genus 
of a sphere is zero and the genus of a torus is one. According to the Gauss-Bonnet 
theorem, the total curvature is 4 jt (1 — g): 


L 


KdS = An (1 — g) . 


(12.91) 


In particular, the total curvature is A it for any surface of genus zero and 0 for any 
surface of genus one. In Chap. 17, we discuss the Gauss-Bonnet theorem in greater 
detail and give a novel proof based on the calculus of moving surfaces. 

Exercise 275. Verify the Gauss-Bonnet theorem for a sphere. That is 


Exercise 276. 


L 


KdS = Art. 

Verify the Gauss-Bonnet theorem for a torus. That is 
KdS = 0. 


L 


(12.92) 


(12.93) 


12.11 Summary 

This chapter, devoted to curvature, was one of the most exciting to write. Curvature 
is not only an intriguing geometric concept, but also the source of most of the 
analytical challenges that make tensor calculus an indispensable tool of differential 
geometry. Armed with the tensor calculus framework, we had hardly any difficulties 
with any of the analytical calculations. This is truly a tribute to the power of 
the tensor technique. Interestingly, all of the results discussed in this chapter 
were known to Gauss and Riemann. And these great mathematicians did not 
have contemporary techniques at their disposal, which makes their achievements 
all the more remarkable. 


Chapter 13 

Embedded Curves 


13.1 Preview 

In this chapter, we apply the methods of tensor calculus to embedded curves. In 
some ways curves are similar to surfaces and in some ways they are different. 
Of course, our focus is on the differences. When embedded in Euclidean spaces 
of dimension greater than two, curves are not hypersurfaces and therefore do not 
have a well-defined normal N and curvature tensor B a p . Furthermore, a number of 
interesting features of curves can be attributed to their one-dimensional nature. For 
example, curves are intrinsically Euclidean: they admit Cartesian coordinates (arc 
length) and their Riemann-Christoffel tensor vanishes. Other properties that stem 
from the curves’ one-dimensional nature are captured by the Frenet formulas. 

We also consider curves embedded in two-dimensional surfaces which are, 
in turn, embedded in the ambient Euclidean space. Curves embedded in two- 
dimensional surfaces are hypersurfaces with respect to that embedding. Therefore, 
much of our discussion of general hypersurfaces carries over to curves embedded 
in surfaces. However, the ambient surfaces are not Euclidean! This gives us an 
opportunity to discuss elements of differential geometry from a Riemannian point 
of view. 


13.2 The Intrinsic Geometry of a Curve 

Curves are intrinsically Euclidean. In analytical terms, the Riemann-Christoffel 
tensor vanishes for all curves. While we demonstrate this below, it is also clear 
from geometric intuition. A curve can be parameterized by its arc length, which is 
equivalent to introducing a Cartesian coordinate system. To convince yourself of 
this, imagine the curve as an inextensible string with etched-in arc length coordi- 
nates. Suppose now that the string is pulled into a straight line. Since the string is 
inextensible, no part of it is stretched or compressed. Thus the transformation was 
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isometric — in other words, all distances along the curve are preserved. Now that the 
string is straight, it is clearly a Euclidean space referred to Cartesian coordinates. 
However, to an intrinsic observer only capable of measuring distances along the 
curve, nothing has changed. Therefore, the original curve is also Euclidean. 


13.3 Different Parametrizations of a Curve 

Curves are parametrized by a single variable often taken to be the (signed) arc 
length s. Most, if not all, classical texts approach curves in this way. The arc length 
approach has a number of advantages, not the least of which is the clear geometric 
interpretation. For example, the quantity 


dR(s) 

ds 


(13.1) 


is the unit tangent. Could there be a more natural object associated with a 
curve? However, the arc length approach goes very much against the spirit of 
our subject. We have passionately advocated against choosing special coordinate 
systems in deriving general results. The reasons against doing so were outlined 
for general Euclidean spaces in Chap. 3. Those reasons remain valid for curves. 
More practically, what does one do in situations where arc length is an inconvenient 
parameter? In fact, for most curves, it may be quite difficult to come up with the arc 
length parametrization. For example, consider the ellipsoidal spiral 


x (t) = A cos t 

(13.2) 

y (t) = B sint 

(13.3) 

z(t ) = Ct. 

(13.4) 


To come up with an arc length parameterization, one would need to calculate the 
integral 


s (t) = f \l A 2 sin 2 1' + B 2 cos 2 1' + C 2 dt ' , (13.5) 

Jo 

and then invert the resulting function of t to obtain t (s). Clearly, this is not a 
desirable approach in most practical applications. 

We therefore refer the curve to an arbitrary coordinate system. The coordinate 
variable is denoted by I/ 1 . Despite the fact that there is a single dimension, we 
add an index to the independent variable in order to preserve the tensor notation. 
Furthermore, we use capital Greek letters to represent the value 1. Thus, U 1 may be 
referred to as U® or U 9 . This allows us to draw a close parallel between curves and 
surfaces and to reap full benefits of the tensor calculus framework. 


13.4 The Fundamental Elements of Curves 
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Refer the ambient space to coordinates Z' and that the embedded curve to the 
coordinate (7®: 


Z' = Z' (U) . 


(13.6) 


The shift tensor Z$ is obtained by partial differentiation 


Z 


4 > — 


dz 1 ' 

du*' 


(13.7) 


As with embedded surfaces, a variant is considered a tensor if it changes according 
to the tensor rule with respect to changes of both coordinate systems. 

The covariant basis U®, consisting of a single vector, is defined by 


U® 


3R(C7) 

9(7® 


(13.8) 


and is related to the ambient basis Z, by contraction with the shift tensor 


u® = z ,z 


i 

<D* 


(13.9) 


The covariant metric tensor (7®®, a single number which could be thought of as a 
lxl matrix, is defined by the dot product 


{7®® = U®-U®. (13.10) 

It is related to the ambient metric tensor Z,y by a double contraction with the shift 
tensor: 

U 99 = Z ij Z\ b Z J 9 . (13.11) 

The contravariant metric tensor (7®® is the “matrix” inverse of (7®®: 

C/ 0 ®E/‘ m ' = &%. (13.12) 

Finally, the contravariant basis U® is given by 

U ® = (7® lI 'U®. (13.13) 


The object U, defined as the determinant of the covariant metric tensor 
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is a relative invariant of weight 2. It is given explicitly by the formula 


U = 


dZ' dZJ 
ZiJ dU T dU T ' 


(13.14) 


The length element or the line element \flj is the square root of U , and it is a relative 
invariant of weight 1 . The length of curve s corresponding to the coordinate ranging 
from a and b is given by the integral 


s = 


f 


VUdU. 


This equation was derived in Sect. 5.8. 

The Levi-Civita symbols and e® are defined by 


£<|) 


£ 


<J> 


Vu 

i 

Vu' 


(13.15) 


(13.16) 

(13.17) 


The line element is a relative tensor of weight 1 when treated as a variant of 
order zero. But it is an absolute tensor when treated as a variant of order one. 

In Chap. 9, we discussed the Levi-Civita in two or more dimensions. Equations 
(13.16) and (13.17) are a generalization to one dimension. For curves, the Levi- 
Civita symbol loses its skew-symmetric characteristics. Nevertheless, it remains 
a very useful object. For example, the combination is an invariant. It must 

therefore be the unit tangent. 

Exercise 277. Explain why is the unit tangent. Hint: Consider U l = s. 

The Christoffel symbol rj 1 ^ is defined by 




IT 


3U® 

W*' 


The Christoffel symbol is expressed alternatively by 


(13.18) 


r 


© _ 


■V 

3t7* 


■ u®. 


(13.19) 


In terms of the metric tensor, the Christoffel symbol is given intrinsically by the 
usual identity 


r e _ It7©£2 ( dUn<t> 

2 V ^ + dU* 


9LW \ 

dU Q ) ' 


(13.20) 
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When a curve is parameterized by the arc length s, the values of the fundamental 
objects are consistent with the Cartesian nature of such parameterization. Namely, 
with respect to the arc length s. 


Uw = [1] ; U™ = [1] ; rf* = 0. (13.21) 


13.5 The Covariant Derivative 


The covariant derivative V® applied to a tensor TTjJ 1 with a representative collection 
of indices is entirely analogous to the surface covariant derivative: 


3 W: 


VeWj* = ^JW + - z% r™ wi% + T$ a wj$ - r g 9 wj*. (13.22) 


This covariant derivative V® has all of the same properties as the surface covariant 
derivative described in Chap. 1 1 . In particular, it produces tensor outputs for tensor 
inputs, satisfies the sum and the product rules, commutes with contraction and obeys 
the chain rule for curve restrictions Wj of ambient variants 

v®w; = Z k V k W‘. (13.23) 


There is nothing that can be said about the covariant derivative V® that was not 
said in Chap. 1 1 on the covariant surface derivative V a . The definition (13.22) does 
not rely on the one-dimensional nature of curves and is therefore valid for subspaces 
of arbitrary dimension. We highlight an important difference: since a curve is not 
a hypersurface with respect to the ambient space and there is no a priori normal 
direction, the quantity cannot be used to define a curvature tensor 5$^ . Thus, 

our approach to analyzing curvature must be modified. This is accomplished in the 
next section. 

We can take advantage of the one-dimensional nature of curves and define a new 
differential operator that produces tensors of the same order as the argument. The 
new operator V s is defined by the combination 


V s 


e 0 V® 


(13.24) 


that is 




s & v @ w;l 


(13.25) 


The following exercises describe the analytical properties of this operator. 
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Exercise 278. Show that V v coincides with d /ds for invariants 

dW 

W S W=—. (13.26) 

ds 

Exercise 279. Show that V v satishes the sum and product rules. 

Exercise 280. Show that V v commutes with contraction. 

Exercise 281. Show that V v satishes the following chain rule for curve restrictions 
of ambient variants 

V s Wj =e & Z k & V k Wj. (13.27) 

Therefore, 

V s Wj = T k V k W‘, (13.28) 

where T k is the contravariant component of the unit tangent. 

Exercise 282. Show that 

V 2 = VoV 4 ”. (13.29) 


13.6 The Curvature and the Principal Normal 

As was the case for hypersurfaces, the object does not vanish. Instead, it is 

orthogonal to the tangent space: 


U® • VoLKj, = 0. (13.30) 

Derivation of this equation is analogous to that of equation (11.15). In the case of 
hypersurfaces, we argued that since VcjUip is orthogonal to the tangent plane, it 
must point along the normal N . For curves, there is no unique normal direction. 
For curves embedded in the three-dimensional Euclidean space, the normal space at 
each point on a curve is a plane. In general, the dimension of the normal space equals 
the co-dimension of the curve. Therefore, we are not able to represent V,j,U^ as a 
product of an invariant vector and a tensor with scalar elements. We refer to this 
object as the vector curvature tensor and denote it by B$q<: 


= V*U*. (13.31) 

For curves, the vector curvature tensor consists of a single vector. The indices 
continue to play their important role: they remind us that is not an invariant 
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vector and indicate how changes under a reparametrization of the curve. In 
order to obtain an invariant, one of the indices must be raised and contracted with 
the other to yield B|. The invariant B| is called the curvature normal. 

The curvature normal B| can be viewed from two complementary points of view. 
If the curve is interpreted as a general embedded surface, the curvature normal is 
seen as the surface Laplacian of the position vector R: 


B® = V^V^R. 


(13.32) 


This interpretation is an immediate consequence of the definition in equation 
(13.31). This is the interpretation of Chap. 12. Alternatively, if the emphasis is on 
the one-dimensional nature of the curve, B| can be represented as 



(13.33) 


where the invariant derivative V, is defined in equation (13.24) and the equivalence 
of (13.32) and (13.33) follows from equation (13.29). The latter interpretation is 
particularly important in the following section on Frenet formulas. 

At points where the vector B® does not vanish it can be represented as a product 
its length k, called the curvature , and a unit vector P called the principal normal P: 



(13.34) 


This concept of curvature is analogous to the mean curvature of a hyper surface. 
However, the curvature k is always nonnegative and the principal normal P is only 
defined at points with positive curvature k. By contrast, the mean curvature B“ of a 
hypersurface can be positive or negative and the normal N is defined everywhere. 
These differences are illustrated in Fig. 13.1, which shows a curve that can be 
considered either as a hypersurface in the plane 


x ( t ) = t 
y (t) = sin t 


(13.35) 

(13.36) 


and as a curve in the three-dimensional space 


x (f) = t 


(13.37) 

(13.38) 

(13.39) 


y ( t ) = sinf 


z (?) = 0 . 


In the former case, the normal A is a continuous vector field and the mean curvature 
is given by 
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Fig. 13.1 The first two plots show a sinusoid viewed as embedded in the two-dimensional plane. 
With respect to this embedding, the sinusoid is a hypersurface and has a well-defined continuous 
unit normal field N. The other two plots show the sinusoid embedded in the three-dimensional 
space. The curvature normal B" is a smooth vector field, while the principal normal P is only 
defined at points of nonzero curvature 


In the latter case, the principal normal P is piecewise continuous and is undefined at 
points of zero curvature. Also, when the curve is interpreted as a hypersurface, the 
direction of the normal can be arbitrarily chosen, while the direction of the principal 
normal P is unique. 


13.7 The Binormal and the Frenet Formulas 


The focus of the preceding sections was on the elements that curves share with 
surfaces of higher dimensions. The focus of this section is on the one-dimensional 
nature of curves. We start with the position vector R and repeatedly apply the 
invariant derivative V s . Since, according to equation (13.26), V v is equivalent to 
d / ds when applied to invariants, we may as well refer the curve to the signed arc 
length s and use the ordinary derivative d/ds. Keep in mind, however, that, for 
invariants, d/ds is equivalent to e^V®, which means that the entire discussion can 
be easily translated to arbitrary coordinates. 

The unit tangent vector T is obtained by applying d / ds to the position vector T: 


d R 

ds 


= T 


(13.41) 


An application of d /ds to T yields the curvature and the principal normal: 


d T 

ds 


= K P, 


(13.42) 
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where the principal normal P is unit length and orthogonal to the tangent T: 

(13.43) 

(13.44) 


P P= 1 
P T = 0. 


Exercise 283. Differentiate the identity T ( 5 ) • T (s) = 1 to conclude that P is 
orthogonal to T. 

Applying d/ ds to equation (13.43): 


and to equation (13.44): 



d P 

— -T + k = 0. 
ds 


The last equation can be written as 


(13.45) 


(13.46) 


( 


dP 

~ds +K 


■ T = 


0 


(13.47) 


which shows that the vector 


— +kT (13.48) 

ds 

is orthogonal to T. Furthermore, from equation (13.45) it follows that this vector is 
also orthogonal to P. Denote this vector by rQ: 

dP 

— +kT= tQ, (13.49) 

ds 

where Q is a unit vector and the sign of r is chosen such that the triplet T, P, Q 
forms a right-handed set in the following sense 

s ijk T i P J Q k = 1. (13.50) 

The quantity r is called the torsion and the vector Q is called the binormal vector 
or simply the binormal. 

The binormal Q can be expressed as the vector product of the tangent and the 
principal normal 


Q = T x P. 


(13.51) 
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Applying d /ds to both sides of the equation, we find 


d Q 

ds 



(13.52) 


or 


- = PxP + Tx (-kT + rQ) . 


(13.53) 


Since T x Q = — P, we have 


The combination of equations (13.42), (13.49), and (13.54), summarized here: 



(13.55) 


(13.56) 


(13.57) 


are called the Frenet formulas. 

Exercise 284. Show that the vector dP/ds + k T is obtained from dP/ds by the 
Gram-Schmidt orthogonalization procedure. 

Exercise 285. Calculate the principal normal, binormal, curvature, torsion for the 
spiral 


x (?) = 4 cos? 
y ( t ) = 4 sin? 
z(t) = 3 ?. 


(13.58) 

(13.59) 

(13.60) 


Exercise 286. Repeat the calculation for the spiral 


x (?) = 4 cos? 
y (?) = 4 sin? 
z(t) = -3?. 


Note which answers change for this spiral. 


(13.61) 

(13.62) 

(13.63) 
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13.8 The Frenet Formulas in Higher Dimensions 

When the Frenet formulas C 1 3.55)— ( 13.57) are written in matrix form 


r dT -| 

ds 


K 


"T" 

dP 

ds 

= 

—K T 


P 

dQ 

- ds - 


— T 


_Q_ 


and we are struck by the skew symmetric property. This is part of a larger pattern. 

Suppose that a curve is embedded in an ;V -dimensional Euclidean space. At a 
given point on the curve, construct an orthonormal basis according to the following 
procedure. The initial vector T 0 is taken to be the unit tangent T. Subsequently, 
T„, is obtained is obtained by applying a single step of Gram-Schmidt algorithm to 
dT m -\/ds to make it orthogonal to each of the preceding vectors and factoring out 
a scalar multiple to make it unit length. 

In the preceding section, we examined the first two steps of the procedure. The 
vector dTo/ds is automatically orthogonal to To, since To (.v) is unit length for all s. 
Thus, in the context of the Gram-Schmidt procedure, it only needs to be normalized 
to produce T i : 


dTo 

T b (13.65) 

ds 

Using the terminology of the preceding section, T[ is the principal normal P and K\ 
is the curvature k. Thus, Ti satisfies the following identities 


Ti • To = 0 

Ti-T, = 1. 


Differentiating the first identity we find 


d Tj 
ds 


T 0 + T! 


d To 
ds 


which, by equation (13.65), gives 


d Ti 
ds 


To 


~K\. 


(13.66) 

(13.67) 


(13.68) 


(13.69) 


Differentiating the second identity yields the familiar orthogonality condition 
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Thus, dTi/ds is orthogonal to Ti. Therefore, the Gram-Schmidt formula 
includes the term that corresponds to To and a coefficient K 2 chosen so that T 2 
is unit length: 


K2T2 = 



Therefore, by equation (13.69), we have 


K 2 T 2 = 


d T, 
ds 


+ kiTo, 


or 


dT\ 

ds 


K2Y2 — tfiTo. 


(13.71) 


(13.72) 


(13.73) 


We recognize this relationship as equation (13.49), where T 2 is the binormal Q and 
K 2 is the torsion r . 

We now embark on the inductive step of the procedure. The unit vector T m is 
orthogonal to all preceding vectors T/ f <m : 


T m • T 0 = 0 (13.74) 

T,„ ■ Ti = 0 (13.75) 


T/n • T k = 0 


(13.76) 


T • T 


1 . 


(13.77) 


Furthermore, presume that up to k = m — 1 , we have 
d T* 


ds 


= Kk+lTk+l ~ KkTk-1- 


which we demonstrated for k = 0 and 1 . 

Differentiate equation (13.76), valid for k < m, with respect to s: 

dT m dTk 

-j ~ ■ T, + T m • — ^ = 0. 
ds ds 

Substitute equation (13.78) into equation (13.79): 

dT 


1 T k + Kk+lTm • Tfc+l — KkTm ' T k- 


0. 


(13.78) 


(13.79) 


ds 


(13.80) 
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The dot product T,„ • T*_i vanishes for all k < m. Meanwhile, the dot product 
T„, • T/ f + 1 vanishes for all A: < m — 1 and equals 1 for k = 77? — 1. Therefore, 
equation (13.80) tells us the following 


d T m 
ds 


■ Tt = 0, for k < m — 1 


d T m 
ds 


T m - 1 — K 


m 


(13.81) 

(13.82) 


In other words, dT m /ds is orthogonal to To, .... T,„_ 2 , but not T,„_i . The fact that 
dT m /ds is orthogonal to T m follows from equation (13.77). Therefore, the Gram- 
Schmidt formula always contains a single term 


rp dT m ( dT m rj . 

^ 777+1 - 1 / 77+1 — , ( 7 1 / 77 — 1 J A yyi — J , 


_ / d T,n \ 

v ds ' m_1 j m_1 


\ ds 

which, with the help of equation (13.82) reads 

f d T,„ | r 

+ 1 T m -f | — - “(“ K m T w _ 1 . 

ds 

This equation can be written as 

d T„ 




+ /TmT m — | . 


(13.83) 


(13.84) 


(13.85) 


and the proof by induction is complete. 

Exercise 287. Explain why the last equation in the Frenet procedure reads 

— = -tov-iTtv-2, (13.86) 

ds 

consistent with equation (13.57). 


13.9 Curves Embedded in Surfaces 

Leonhard Euler’s original investigation of surface curvature was based on analyzing 
curves embedded in surfaces. Euler’s analysis focused on intersections of surfaces 
with planes since curves embedded in planes was well understood. 

We have so far analyzed two different types of embedded manifolds: hypersur- 
faces and curves embedded in a Euclidean space. We are about to analyze a new 
combination — curves embedded in surfaces. This configuration has a fundamentally 
new characteristic: the ambient space is not Euclidean. This is a typical situation 
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in Riemannian geometry, where the ambient space provides a metric from which 
the differential elements of the embedded manifold are derived. Therefore, the 
discussion contained in this section is essentially an example of Riemannian 
analysis. 

Suppose the surface is given by the equation 

Z ! = Z' (5) , (10.3) 

and the embedded surface is given by 

S a = S a (U), (13.87) 


as before. We first discuss the embedding of the curve in the surface. This is the 
Riemannian portion of our analysis. While many of the equations are identical to 
previously encountered relationships, their interpretation is different. 

The shift tensor 5$ is given by 


35“ (U) 
® dU® 

The covariant metric tensor [7$^ is defined by 

Uw = s a pS% 


(13.88) 


(13.89) 


Despite the outward similarity of equations (13.89) and (13.11), their interpretations 
are completely different! Equation (13.11) is not a definition of the metric tensor 
but rather a consequence of the definition (13.10). A definition based on the 
covariant basis is not possible here, since we are considering embedding the curve 
with respect to the surface and ignoring the larger ambient Euclidean space. 

The next few steps follow a familiar pattern. The contravariant metric tensor 
is the matrix inverse of U^: 


U^Uyn, = 8%. (13.90) 

From equation (13.89), we find 

£/*# = (13.91) 

and 

Sp 5^ = St (13.92) 

The Christoffel symbol Tj 1 ^ also cannot be defined in the extrinsic fashion, as 
was done in equation (13.18). We must instead accept the intrinsic definition 
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r © _ 1 T T en ( du nf . du ^ 

** 2 \ at/* + at/® 


3 (Jrjity \ 

dU n ) ' 


(13.18) 


The curve Christoffel symbol T®^ is related to the surface Christoffel symbol T^ r 
by the relationship 


r) *\ a 

-po; c< oF I $ 

A c&tj/ ’ 


(13.93) 


The definition of the covariant derivative reads 


V T’aQ 


ar“® 

P'V I o/ -pa rpco* i> 
“T °© A yco 1 PV 


d u & 


r»y ptw 'T’dQ _i_ p<J> '-ra.Q. pf2 t-'Q'O 

°0 A yP^coV 1 A 0'I' i £ft > 


(13.94) 


This derivative satisfies all the familiar properties. 

The length element or line element yfU is defined as the square root of the 
determinant of the lxl covariant metric tensor U^q,: 

VU = y/\U7\. (13.95) 


The Levi-Civita symbols £$ and £® are defined by 


£<j> 


£ 




Vu 

l 

Vu" 


(13.96) 

(13.97) 


The curve, being a hypersurface with respect to an ambient surface, has a well- 
defined normal /?“ given by 

n a = e^e^Sf. (13.98) 

We have already seen a definition analogous to this one in Sect. 10.7, where the 
normal was defined for curves embedded in two-dimensional planes. Since in this 
definition, nothing relies on the flatness of the ambient space, it remains valid for 
hypersurfaces embedded in Riemann spaces. The normal n a is unit length 

n a n a = 1 (13.99) 

and is orthogonal to the tangent space 

n a S% = 0. (13.100) 

We next the curvature tensor b <j»p according to the equation 


= b^n 01 . 


(13.101) 
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This equation is identical to (1 1 .42) which, in turn, is an immediate consequence of 
the definition (11.16). However, the basis for equation (11.16) was the fact that 
V,ySp is orthogonal to the tangent space spanned by Sq, and thus colinear with 
N. If you recall, the demonstration of that fact relied on the extrinsic definition 
(10.80) of the Christoffel symbol. Therefore, in order to use equation (13.101) as the 
definition of curvature, we must show that is orthogonal to the tangent space 

by using the intrinsic definition (13.18) of the Christoffel tensor or the equivalent 
equation (13.93). 

The analytical expression for being orthogonal to the tangent space reads: 

S a ®V*S| = 0. (13.102) 


Let us expand the covariant derivative V ( j< 

ds, 


Y7 c<x , (?Pj^a qY 7-1 £2 <?a 

and contract both sides with 5® 

C©Y7 o© i c & cfi cy c 0 ' C 

V iJ/O* — >J„ „ TT ,„ T 1 # v O„ 1 (bdl^O A 


dU* 


(13.103) 


(13.104) 


According to equation (13.93), the first two terms produce T^,. The last term yields 
— rf® since S^S® = 8 ^ and equation (13.102) is confirmed. 

We are therefore able to accept equation (13.101) as the definition of the 
curvature tensor b^. The invariant 


b%, (13.105) 

is called the geodesic curvature. Of course, geodesic curvature is the same as 
the mean curvature. However, the term geodesic curvature is applied to one- 
dimensional curves while mean cun’ature is applied to hypersurfaces in higher 
dimensions. 

At a particular point on a curve we have three curvature tensors in play: the 
curvature tensor B“ of the host surface, the curvature tensor characteristic of the 
curve’s embedding in the surface, and the vector curvature tensor B$vp characteristic 
of the curve’s embedding in the ambient Euclidean space. These three tensors and 
linked by an identity that we are about to derive. 

Express the equation of the curve (13.6) as a composition of the equation of the 
surface (10.3) and the equation of the curve (13.87) within the surface 

Z' (U) = Z' (S ((/)) . (13.106) 

Differentiating this identity with respect to U 41 yields a relationship among the three 
shift tensors Z ' rt) . Z' y , and S%: 


13.9 Curves Embedded in Surfaces 


231 


Z* = KSl- (13-107) 

The vector form of this equation is obtained by contracting both sides of this 
equation with the ambient covariant basis Z, : 

U $ = S„S£. (13.108) 

Apply the covariant derivative to this identity: 

V^U® = V*S„ SI + S (13.109) 

The combination on the left-hand side is B®®. The tensor V®S a is expanded by the 
chain rule 


V®S„ = sftp S a = N B aP si. (13.110) 

Finally, equals 7>®®n“. Putting these elements together yields the identity 

B®® = HB a pSgS$, + S a n a bw. (13.111) 

We are interested in one particular corollary of this rich and beautiful relationship. 
It states that B a pS^S^ is the normal component and n a b ®® are the tangential 
components of the vector curvature tensor B®®. Let us focus on the normal 
component, which can be isolated by dotting both sides of this identity with the 
normal N: 


N-B w = B afl SZS$ (13.112) 

The tensor N • B®® is called the principal curvature tensor. While vector 
curvature tensor B®® depends on the curve’s embedding in the ambient space, 
equation (13.112) tells us that its normal component N • B®® depends only on the 
surface curvature tensor and the manner of the embedding. 

As the next step, raise the index T and contract with O: 

N-B % = B aP S%S p * (13.113) 

By equation (10.31) interpreted in the current context, the right-hand side becomes 

N-B % = B“-B ap n a n p . (13.114) 

This equation leads to numerous geometric insights. The left-hand side can be 
rewritten as acN ■ P where k is the curvature and P is the principal normal. The 
right-hand side can be rewritten as (By S a p — B a p) n a tv' . We have 
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kN-P= (13.115) 

Let us limit our attention to curves where the principal normal P is aligned with 
the surface normal N, which would, in fact, make them equal. For such curves, 
equation (13.115) becomes 


K=^S a p-B afi )n a n< > , (13.116) 

which is a very interesting formula that states the curvature k , characteristic of 
the curve’s embedding in the larger Euclidean space, depends on the metric and 
curvature tensors of the surface and the orientation — given by n a — of the curve. 

Let us ask the following question which, in 1760, was asked by Euler: Among all 
possible embedded curves obtained by cutting the surface by an orthogonal plane, 
which curves have the highest and the lowest curvatures? Equation (13.116) holds 
the answer to this question. 

By choosing different planes, we are essentially choosing Therefore, alge- 
braically, the question becomes: what are the extreme values of 

(B^S a p- 5 a/ j) nV (13.117) 

subject to the condition that n a is unit length 

S a pn a n p = 1. (13.118) 

This is a fundamental question in linear algebra. It is well-known that the extremal 
values and directions are given by the eigenvalue problem 

(B y y S ap - B aP ) n p = kSapnt. (13.119) 

Juggling the indices yields 

(Bffi - B%) n p = Xn a . (13.120) 

Therefore, the problem is reduced to finding the eigenvalues and eigenvectors of the 
matrix 


B r y S a p - Bp. (13.121) 

The simple argument we are about to make is best seen in plane matrix notation. 
Denote the tensor Bp by the matrix B and let its eigenvalues be and /c 2 . Then the 
tensor By — Bp is represented by the matrix 


(*h + k 2 ) I - B , 


(13.122) 
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which has the same eigenvectors as B and the corresponding eigenvalues are 

(k\ + k 2 ) - K\ = k 2 (13.123) 

(k\ + k 2 ) - k 2 = K\ (13.124) 

Therefore, the extremal values of the quadratic form (13.117) are the invariant 
eigenvalues K\ and k 2 of the curvature tensor 7>". These eigenvalues are known 
as the principal cun’atures of the surface. Their sum equals the mean curvature: 

= *-| + k 2 (13.125) 

and their product equals Gaussian curvature: 

K = k x k 2 . (13.126) 

Furthermore, due to the symmetry of B a p the corresponding directions are 
orthogonal. 


13.10 Geodesics 

The subject of embedded curves usually includes a discussion of geodesics. Given 
two points A and B on the surface, the geodesic is the curve of least length 
connecting A and B. Geodesics satisfy a set of second-order equations known as 
the geodesic equations. These equations are usually derived [31] by following the 
standard Euler-Lagrange procedure. We pursue an entirely different approach based 
on the calculus of moving surface. We therefore postpone the subject of geodesics 
until Chap. 17. 


13.11 Summary 

In this chapter, we studied curves from two perspectives: as general submanifolds 
of codimension greater than one, and as submanifold of dimension one. The former 
perspective led to the concepts of the curvature normal tensor B'j’ which reduces to 
N B'f, for hypersurfaces. The latter perspective led to an equivalent concept of the 
principal normal P, the binormal Q as well as higher-order derivatives with respect 
to arc length s. 

Finally, we analyzed curves embedded in surfaces and recovered Euler’s defini- 
tions of principal curvature. 


Chapter 14 

Integration and Gauss’s Theorem 


14.1 Preview 

In this chapter, we pursue two goals. First, we discuss integration from a geometric 
point of view and establish the tensor calculus way of representing invariant 
integrals in arbitrary coordinates. Second, we prove a rather general form of Gauss’s 
theorem. The starting point for the derivation is Gauss’s theorem over flat domains 
referred to as Cartesian coordinates. Our task is to extend that result to arbitrary 
curved patches. 

This chapter first defines a single kind of invariant integral in which a physical 
quantity is integrated over a geometric domain. The domain may be a section of a 
curve, a surface patch, or a three-dimensional domain. In all cases, the integral is 
defined by a procedure in which the domain is broken up into ever smaller parts and 
the limit of a finite sum is considered. All types of integrals (work, flux, circulation, 
etc.) encountered in applications can be reduced to an invariant integral. 

Gauss’s theorem and Stokes’ theorem transform integrals over closed domains to 
integrals over the boundaries of those domains. Both theorems are multidimensional 
generalizations of the fundamental theorem of calculus. We demonstrate that the 
two theorems are closely related. The formulation of these theorems took several 
decades and efforts from some of the nineteenth-century’s brightest scientific minds, 
including Gauss, Ostrogradsky, Cauchy, Poisson, Maxwell, and Riemann [24]. 


14.2 Integrals in Applications 

In applications, integrals represent physical and geometric quantities. For example, 
the total mass M of a body 12 with density distribution p is given by the volume 
integral 
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M = I pdQ. (14.1) 

Jn 

The kinetic energy T of a fluid with density distribution p and velocity field V is 
given by the integral 



(14.2) 


The flux F of the fluid across a surface S is given by the surface integral 

F = J V-N dS. (14.3) 

The total charge Q distributed with density a over the surface S of a conductor is 
given by the surface integral 

Q = J adS. (14.4) 

The total force F of pressure exerted on a body £2 with boundary S immersed in 
fluid with pressure distribution P is given by the surface integral 

F = -/ s ™ dS. (14.5) 

The total curvature T of a surface S is given by 



(14.6) 


where K is the Gaussian curvature. 

The total gravitational energy V of a string U with linear mass density p 
suspended in a gravitational field with potential P is given by the contour integral 


V = f pP dU. 


(14.7) 


Ju 

Each of the integrals in these examples has the form 


L 


Geometric Domain 


Invariant. 


(14.8) 


The definition of the integral (14.8) involves a limiting process, in which the domain 
(whether it is £2, S, or U) is divided into small pieces and a finite sum approaches a 
limit. This procedure is nontrivial, but its independence from coordinates is evident. 


14.3 The Arithmetic Space 
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14.3 The Arithmetic Space 

The definition of the physical integral 

I FdQ, I FdS, I FdU (14.9) 

JQ. Js Ju 

is entirely coordinate free. However, in order to evaluate a physical integral, one 
almost always needs to introduce a coordinate system and transform the invariant 
integral into a repeated arithmetic integral. The transformation to a repeated integral 
consists of three steps. First, the physical field F is expressed as a function F (Z), 
F (S) or F ( U ) with respect to the coordinates. Second, the domain of integration 
is translated into the corresponding arithmetic domain. Third, an additional factor 
is introduced that adjusts for the length, area, or volume mismatch between the 
physical domain and its arithmetic representation. 

Consider an example. Let F be the scalar field defined as the distance squared 
from a point P . What is the integral of F over a unit disk Q. with a center that is a 
distance y/2 away from PI This problem is illustrated in Fig. 14.1. 

Can you solve this problem without introducing a coordinate system? Perhaps 
you could, but given the available power of coordinates, it is hardly worth the 
effort. Keep in mind that coordinate systems are a recent luxury. Ancient Greeks, 
in particular Archimedes of Syracuse whose portrait appears in Fig. 14.2, did not 
have analytical methods at their disposal, yet were remarkably good at evaluating 
areas and volumes. Famously, Archimedes discovered that the volume of a sphere 
is 4tt/3. The answer to the problem in Fig. 14.1 is 5tt/2, so perhaps Archimedes 
could solve it geometrically, as well. However, with the benefit of coordinates, one 
need not be Archimedes to solve this problem. 


Fig. 14.1 A unit disk £2 and 
a point P a distance from 
the center of the disk. The 
scalar field F is defined as the 
distance squared from the 
point P . The firgure includes 
six level lines of the function 
F for values betwen 1 /2 
and 11/2 
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Fig. 14.2 This 1620 work by 
the Italian Baroque painter 
Domenico Fetti 
(c. 1589-1623) depicts 
Archimedes toiling over a 
problem in natural science 
without the benefit of a 
coordinate system 



We ought to select a coordinate system that is well-suited for the particular 
problem at hand. Let us first consider Cartesian coordinates x,y with the origin 
at the center and coordinates (1, 1) at the point P . Then the domain f2 is expressed 
by the relatively simple integration and the integral is given by 


/ 

JO, 


FdQ 



l) 7 + {y — l) 7 ^ dxdy 


(14.10) 


If we shift the coordinate system to the point P , then the integrand simplifies, while 
the limits become more complicated 


f /*o r~ t+\/ 1 — (v+t) 2 

/ FdQ.= / / (x 2 + y 2 ) dxdy. 

Jn J - 2 i-i-Vi-(v+0 2 


(14.11) 


These integrals are not particularly easy to evaluate. That is because Cartesian 
coordinates are usually not the best choice when describing circular geometries. 

A significant simplification comes from using polar coordinates. If the origin of 
the polar coordinates r, 6 is at the center of the disk and the point P is at 0 = 0, 
then the domain Q corresponds to a simple rectangle (r, 6) e [0, 1] x [0, 2n] in the 
arithmetic space. The distance squared is given by the law of cosines 


F (r, 9) = 2 + r 2 — 2\[2r cos 9 


(14.12) 
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Fig. 14.3 With respect to the polar coordinate system, illustrated on the left, the domain £2 
corresponds to the arithmetic domain shown on the right 


and the invariant integral is given by repeated arithmetic integral 



(14.13) 


which is easy to evaluate. From multivariable calculus, the reader is familiar with 
the additional factor of r in the integrand. It is the adjustment for the distortion in 
area between the arithmetic and the geometric domains. Its interpretation from the 
tensor calculus point of view is discussed below. 

The polar coordinate system that we just considered is perhaps the best choice for 
the posed problem because it leads to the simplest repeated integral. Nevertheless, 
let us consider one more polar coordinate system. It is not as convenient for the 
problem we have just solved, but it is better suited for the illustration of the concept 
of the arithmetic space. Place the origin of the coordinate system at P and direct the 
polar axis in the direction tangent to the disk as illustrated on the left of Fig. 14.3. 

In this polar coordinate system, the field F has the simplest form: F (r, 9) = 
r 2 . The main challenge is to identify the limits of integration. It is evident that 6 
changes from 0 to jr/ 2. It is not hard to demonstrate that, for each 9, r varies from 
V2cos (f — 9) — V sin 29 to \/2cos (f — 9) + Vsin2 9. Therefore, the conversion 
to arithmetic integral is 



(14.14) 


The repeated integral on the right hand side of equation (14.14) is called arith- 
metic because it involves algebraic expressions rather than geometric quantities. It 
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can be evaluated by the analytical techniques that the reader learned while studying 
multivariable calculus. The immediate graphical interpretation of equation (14.14), 
illustrated on the right of Fig. 14.3, is Cartesian — thus the term arithmetic which is 
often used synonymously with R" . 

This Cartesian feature of the plot on the right may not seem particularly 
significant at this point. After all, we could have chosen a Cartesian coordinate 
system to begin with. Instead, we chose a curvilinear coordinate system with 
curved coordinate lines. Subsequently, the geometric interpretation of the arithmetic 
integral (14.14) “straightened out” the coordinate lines. Both the original domain C 
and the arithmetic domain are Euclidean, and the curvature of the coordinate lines is 
only due to our choice of coordinates. Looking ahead to studying curved surfaces, 
the Euclidean nature of the arithmetic domain and the associated Cartesian 
interpretation of the coordinates will take on much greater significance. 


14.4 The Invariant Arithmetic Form 


We now discuss the crucial factor of r in equations (14.13) and (14.14). Most 
calculus textbooks include an illustration of the fact that rdrdO represents the area 
of a small coordinate patch. Therefore, the factor of r can be thought of as the 
metric adjustment factor. The corresponding tensor object is the volume element 
x/Z introduced in Sect. 9.7, which may be referred to as the area element and length 
element depending on the context. Tensor calculus offers a simple and general recipe 
for expressing invariant integrals in arithmetic form. It is captured by the equation 

r rBi, 0B2 rB\ 

/ FdQ = / / / F (Z)VzdZ l dZ 2 dZ 3 , (14.15) 

«/ f 2 J A% J A 2 J 

where the integration limits satisfy A\ < B\, A 2 < Bi and A 3 < B3 and are chosen 
so that the integration domain is properly captured. 

We must demonstrate that the repeated arithmetic integral in equation (14.15) is 
invariant under a change of variables. We limit our demonstration to orientation- 
preserving coordinate changes. This orientation-preserving property of a coordinate 
change is given in Sect. 4. 10. The explanation is based on the corresponding theorem 
from multivariate calculus. According to the change-of-variables theorem, when 
going from one coordinate system (unprimed) to another (primed), an integral 
transforms according to the following rule 


n B3 /» Z ?2 P B\ n By p Bj' pB^r 

/ / / F(Z)dZ 1 dZ 2 dZ 3 = / / F(Z')\J\~ l dZ l 'dZ 2 'dzK 

J Aj, J A2 J A\ A-ij J A 2 / J A^r 

(14.16) 

In words, the integrand needs to be re-expressed in terms of the new (primed) 
coordinate systems and multiplied by the absolute value of the determinant of 
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the coordinate transformation and the limits of integration must be appropriately 
adjusted. Let us apply this rule to the integral in equation (14.15). Going from 
unprimed coordinates to the primed coordinates, we have 



F (Z)sfZdZ x dZ 2 dZ 3 



F (Z (Z')) \[ZJ~ x dZ x ' dZ 2 ' dZ 3 ' 


(14.17) 


Recall from Sect. 9.9 that the volume element \[Z is a relative invariant of weight 
— 1 , that is 


■v/Z 7 = J~ x jz. 


(14.18) 


Since F (Z') = F (Z (Z')), we have 



F (Z) s/ZdZ x dZ 2 dZ 3 


r B 3 ' r B i ’ r B \' 
J Ay J Ay J Ay 


F (Z') 4z’ dz v dz 2 ' dz 3 ' 
(14.19) 


which confirms the invariant nature of the integral in equation (14.15). 


14.5 Gauss’s Theorem 


Gauss ’s theorem is of fundamental importance in applied mathematics. After all, it 
is a multidimensional generalization of the fundamental theorem of calculus. For a 
domain G with boundary S, Gauss’s theorem reads 


[ = [ 

Jn Js 


Nj T'dS. 


(14.20) 


It holds for a sufficiently smooth tensor field T' . Importantly, Gauss’s theorem 
holds in Riemannian as well as Euclidean spaces. For instance, on a curved patch S 
bounded by a contour ( 7 , Gauss’s theorem for a tensor T a reads 

[ V a T a dS = f n a T a dU. (14.21) 

Js Ju 

The applications of this theorem are too numerous to even begin to mention. In this 
book, its most frequent application will be in the context of calculus of variations. 

Let us assume that Gauss’s theorem holds in the Euclidean space and prove 
its more general Riemannian form (14.21). Convert the volume integral to the 
arithmetic form 
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J V ; r'jQ = JJI V, T‘ \fZdZ [ dZ 2 dZ 3 


(14.22) 


and expand the covariant derivative according to its definition 

J V,T‘dn = JJJ {^jsfz + T\ k T k sfZ^ dZ x dZ 2 dZ 3 

Transform the first term in the integrand by the product rule 


(14.23) 


L v ' r ‘ m= fff 


i 


d[vVz 


dz 1 


V + Y\,T k s/z I dZ x dZ 2 dZ 3 


dZ' 


(14.24) 


Sincev 


9VZ 

dZ' 


VZT k 


(14.25) 


the last two terms cancel, leaving the single term 


L v ' r ‘ m= I.IJ 


d(T‘Vz) 

v ■ dZ 1 dZ 2 dZ 3 . 


dZ‘ 


(14.26) 


This term is subject to Gauss’s theorem in the arithmetic space. After all, the 
arithmetic space is a Euclidean space referred to as Cartesian coordinates. Thus, 


f VJ'dQ. = I ' NiVVzdS, 

Jn Js 


(14.27) 


where the bar above the letter indicates that the symbol refers to an object in the 
arithmetic space. Express the surface integral in this equation in arithmetic form 


[ V,rjQ= [ NjT’ \/~Z\f§dS l dS 2 . 

Jn Js 


(14.28) 


Let us determine the relationship between the arithmetic and the actual normals 
/V, and Nj . These normals are given by the expressions 

= hi jk e af, ZiZ k , and (14.29) 

Ni = \s ijk s afi ZiZ k . (14.30) 

Note that there are no bars over the shift tensors. The entries of the shift tensor 
are derived from the parametric equations that describe the boundary — and those are 
the same in the arithmetic and the actual spaces. Thus the two normals are related 
by the identity 
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a.dl = N .dl 

Vz Jz 


(14.31) 


where Z = 1 because the coordinates are Cartesian in the arithmetic space. 
Therefore, equation (14.27) can be rewritten as 


f v,rdn= [( 

Jn JJ 


NjT' VSdS l dS 2 


(14.32) 


which is, in arithmetic form, precisely the statement of Gauss’s theorem: 


[ ViT'dQ = [ 

Jn Js 


NiT’dS. 


(14.33) 


14.6 Several Applications of Gauss’s Theorem 

As the first striking application let us show that the integral of the normal over closed 
surface vanishes: 


L 


Nr/S = 0. 


(14.34) 


The integral of the normal may be interpreted as the total force on a body immersed 
in a uniform pressure field. The demonstration of equation (14.34) is entirely 
straightforward. Since the integral in question can be written as 


f N dS = [ 

Js Js 

an application of Gauss’s theorem yields 

[ Nr/S = [ 
Js Jn 


N'ZidS, 


V'Z, dQ. 


(14.35) 


(14.36) 


and the integrand vanishes due to the metrinilic property of the covariant derivative. 

As another even more striking and, perhaps, even more straightforward applica- 
tion, let us show that the integral of curvature normal over vanishes over a closed 
surface 


l 


NB“dS = 0. 


(14.37) 


This identity is a near triviality since the curvature normal equals the surface 
Laplacian of the position vector 
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N B“ = V a V“R. 


(14.38) 


Thus, by Gauss’s theorem, 

I V„V“R dS = I n a V a RdU= I ndU, (14.39) 

Js Ju Ju 

where U is the nonexistent boundary of the closed surface S . Thus, equation (14.37) 
holds. 

If the same integral is considered over a patch rather than a closed surface, then 
the result 


[ N B“dS = f n 
Js Ju 


dU 


(14.40) 


gives a vivid geometric interpretation of mean curvature: Mean curvature measures 
the degree to which the contour boundary normal field n points consistently out of 
the plane. Furthermore, the curvature normal is given by the limit 


f T ,ndU 

= (1441) 

where the integrals are calculated for a patch S that is appropriately shrinking to a 
point at which the normal and the mean curvature are evaluated. Dotting both sides 
with N gives an explicit limit for the mean curvature 


B“ = N • lim 

“ s-*o 


fu ndU 

Js d S 


(14.42) 


14.7 Stokes’ Theorem 

In order to demonstrate the Stokes’ theorem, introduce the unit tangent vector T. Its 
ambient components 7) are given by expression 

7} = e ijk N j n a z k a (14.43) 

Exercise 288. Show that the vector T with components given by (14.43) is indeed 
the unit tangent to the contour boundary. That is, T is unit length 

T ■ T = 1, (14.44) 


orthogonal to the surface normal N 
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T • N = 0, 


(14.45) 


and the contour normal n 


T • n = 0. (14.46) 

Now, onto the proof of the Stokes’ theorem. The contour integral is given by 

I F i T i dU= I F i s ijk N j n a Z k dU, (14.47) 

Ju Ju 

where the integral on the right is immediately subject to Gauss’s theorem: 

[ F'Tj dU = [ V" {F i E ijk N i Z k )dS (14.48) 

Ju Js 

Expand the integrand by the product rule 


L F ‘ Tidu =l (' 


[V*F i e ijk N J zZ + F i e ijk V*N j Z k + F* e ijk N j V* Z k )dS, 

(14.49) 

and apply the appropriate differential identity to each term 

I F'Ti dU= J (z“V m F 1 e ijk NJ Z k -F' B ljk Z J p B a P Z k + F< s ijk N^ N k B a a ) dS. 

(14.50) 

The second and the third terms vanish due to the skew-symmetry of the Levi-Civita 
symbol S;j k . Thus, we are left with a single term 


(14.51) 


[ F i T i dU= [ V n F i s ijk N j Z k ZyS. 

Ju Js 

By the projection formula (10.55), we have 

[ F’Tj dU = f V”F i s ijk N J 8 k m dS - V" F‘ e ijk N j N k N m dS. (14.52) 

Ju Js 

The term with the three normals vanishes due to the skew-symmetry of Si jk and the 
remaining term (with the indices rotated i — »• j — > k — > i) gives us precisely the 
statement of Stokes’ theorem 

[ F‘ Tj dU = f e ijk VF J N k dS, Q.E.D. (14.53) 

Ju Js 

Exercise 289. Explain why the term Sij k Z^Z k B a ^ vanishes. 


246 


14 Integration and Gauss’s Theorem 


14.8 Summary 

This chapter was devoted to integration. Integration proved to be another topic 
that is well organized by the tensor framework. We have highlighted that physical 
integrals are expressed in coordinates by equation (14.15). Furthermore, integrals 
with sufficiently smooth integrands are governed by Gauss’s theorem (14.21). 
Gauss’s theorem has numerous applications. In this book, Gauss’s theorem will 
find important applications in shape optimization and other problems with moving 
surfaces. 


Part III 

The Calculus of Moving Surfaces 



Chapter 15 

The Foundations of the Calculus 
of Moving Surfaces 


15.1 Preview 

Moving surfaces are ubiquitous in life and mathematics. Moving surfaces, like 
stationary surfaces, need their own language. We have shown that, in many ways, 
tensor calculus is an ideal language for describing stationary surfaces. For moving 
surfaces, the language of tensors must be extended to capture the particularities 
of moving surfaces. This extension comes in the form of the calculus of moving 
surfaces (CMS). 

The term moving surface likely invokes an image of dynamically deforming 
physical surfaces, such as waves in water, soap films, biological membranes, or a 
fluttering flag. However, in applications, moving surfaces arise in numerous other 
contexts. For example, in shape optimization — a branch of the calculus of variations 
where the unknown quantity is the shape of a domain — moving surfaces arise as 
a parametrized family of allowable variations. In shape perturbation theory — I 
think this term is self-descriptive enough — moving surfaces arise as evolutions from 
the unperturbed to the perturbed configurations. Finally, moving surfaces can be 
effectively introduced in problems where one may not think that moving surfaces 
can play any role at all. For example, in Chap. 17 we use the calculus of moving 
surfaces to prove a special case of the celebrated Gauss-Bonnet theorem which 
states that the integral of Gaussian curvature over a closed surface is independent of 
the shape of the surface and only depends on its genus (the number of topological 
holes). 

The fundamental ideas of the calculus of moving surfaces were introduced by the 
French mathematician Jacques Hadamard (Fig. 15.1) who studied the propagation 
of discontinuities in continuous media. The basic concepts were introduced in his 
Memoire sur le probleme d’ analyse relatif a Vequilibre des plaques elastiques 
encastrees [21]. Since Hadamard’s pioneering work, numerous contributions, large 
and small, were made by several distinguished applied mathematicians [16,45]. 
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Fig. 15.1 The great French 
mathematician Jacques 
Hadamard (1865-1963) is the 
originator of the calculus of 
moving surfaces 



15.2 The Kinematics of a Moving Surface 

A moving surface S (t) can be thought of as a family of surfaces parameterized 
by a time-like variable t . For simplicity, we refer to t simply as time, although in 
nonphysical scenarios t can be any parameter. We almost always drop the argument 
t and use the symbol S to denote a moving surface. 

A moving surface can be described parametrically by a set of equations 


Z i = Z i (t,S), (15.1) 

where S a are the coordinates to which the surface is referred at time t. As always, we 
suppress the tensor index of function arguments. The ambient coordinates Z' and 
the surface coordinates S a are arbitrary, subject to sufficient smoothness conditions 
discussed in the next paragraph. Equation (15.1) is analogous to equation (10.3) for 
a stationary surface. At any particular moment of time t, the surface is subject to 
classical differential geometry analysis described in the preceding chapters. 

We assume that surface coordinates evolve smoothly. Analytically, this means 
that Z' (t, S) is a smooth function of t . Intuitively, this implies that nearby points 
on nearby surfaces (i.e., surfaces corresponding to nearby times t and t + h) have 
nearby surface coordinates. This describes continuity while smoothness is stronger: 
our coordinates are not only evolving continuously, but differentiably to a sufficient 
number of orders. In particular, the trajectory Z' (t , S 0 ) of a point with fixed surface 
coordinates Sq is a smooth curve. 

Equation (15.1) describe a family of invariant surfaces for a particular com- 
bination of ambient and surface coordinates. In the spirit of tensor calculus, we 
must enable ourselves to parameterize this family of surfaces in a completely 
arbitrary way, only subject to the smoothness requirements just discussed. Consider 
an alternative family of parametrizations 
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Fig. 15.2 The same evolution of an ellipse, in which the horizontal semiaxis equals At, is 
parameterized in two alternative ways. The figure on the left shows parametrization (15.5), while 
the figure on the right shows parametrization (15.6). The dotted lines show the trajectories of 
coordinate particles , that is, points of constant surface coordinates 


Z'=Z'(t,S'), (15.2) 

where we have kept the ambient coordinates unchanged, but introduced new surface 
coordinates S a . What is the relationships between the old coordinates S a and the 
new coordinates S a ? The key point to realize is that the relationship between the 
two coordinate systems must depend on time: 

S a ' = S a ' (t, S) (15.3) 

S a = S a (t,S'). (15.4) 

In other words, equations (10.27) and (10.28) would no longer be sufficient. 
The calculus of moving surfaces, following in the footsteps of tensor calculus, is 
adamant about constructing an analytical framework that is completely independent 
of parameterization, leaving the freedom to choose a parametrization to the analyst. 

Consider the evolution of an ellipse in which one of the semiaxes is growing at a 
constant rate A. Figure 15.2 shows this evolution from t = 1 to t = 2. Perhaps the 
simplest way to parametrize this evolution is 


! x ( t , a) = At cosa 
y (f, a) = sin a 


(15.5) 


In this parameterization, the value of a does not correspond to the polar angle 0. 
The same evolution parameterized by 0 appears as 


f x (t, 6) = 

j.vM) = 


At cos 0 

\J cos 2 6-\-A 2 t 2 sin 2 6 
At sin 6 

\J cos 2 6+A 2 t 2 sin 2 6 


(15.6) 


The former parametrization is seen on the left of Fig. 15.2 and the latter on the right. 



252 


15 The Foundations of the Calculus of Moving Surfaces 


The relationship between 5 1 = a and 5 1 = 9 is, of course, time-dependent 

9 (t, a) = arccot (At cota) (15.7) 

and that is the key point. To reiterate: in order to assure the freedom to choose any 
parameterization for a moving surface, one must consider a time-dependent change 
of variables ( 15.3)— (15.4). If we were to restrict ourselves to time-independent 
changes of parametrization, then there would be no need for the new derivative 
operator V, which is at the heart of the calculus of moving surfaces. 

It is the inescapable time dependence in the relationships (15.3) and (15.4) that 
poses the challenge that the invariant time derivative V seeks to overcome. The 
crux of the problem is that the partial derivative 3/3 1 of a invariant T (t , 5 ) is not 
itself an invariant. We demonstrate this in the following paragraph. So our starting 
point for moving surfaces is more complicated than it was for stationary surfaces 
where the partial derivative 3/3 5“ produced a covariant tensor out of an invariant. 
Of course, matters become even more complicated when the partial time derivative 
3/3 1 is applied to a tensor Tjfi. As you would expect, the result 37" 'j“ ( t , 5 ) /dt is 
not a tensor. 

We now show that the partial derivative d/dt does not preserve the invariant 
property. Suppose that T is an invariant defined on a moving surface, such as the 
mean curvature B" or the Gaussian curvature K. Let U be the partial derivative of 
T with respect to time 


3 T(t,S) 


(15.8) 


and evaluate U according to the same rule in the alternative coordinate system S a ' 


U' = 


dT (t, S') 
3 ~t 


(15.9) 


In order to establish the relationship between U and U\ differentiate the identity 


T(t,S') = T(t,S(t,S')) 


with respect to t . By the chain rule, we find 

/ 3 T(t,S) 37" 35“ (t. S') 

dt 3 S“ 3 1 

Introduce the Jacobian-like object /" defined as 

35“ (t, S') 


(15.10) 


Z“(L5') 


dt 


(15.11) 


15.3 The Coordinate Velocity V 


253 


The definition of the Jacobians and /“ is the same as before except now these 
objects are functions of t : 


■CM') 

Ja (t'S) 


35“ (t, S') 
35“' 

35“' (t, 5) 
35" 


(15.12) 

(15.13) 


Using the newly defined object equation (15.10) can be rewritten as 


U = U + /"v a r. 


(15.14) 


This identity indicates that U is not an invariant; after all, U' / U because of the 
term /"V a 7\ Thus, the tensor property is lost even for invariants. This is not the 
case for the partial derivatives 3/3 Z' and 3/35“, which produced covariant tensors 
out of invariants. In this chapter, we construct a new differential operator V that 
preserves the invariant property. In the next chapter, the new operator is extended to 
arbitrary tensors. 

Exercise 291. Calculate the Jacobians /" and jf for the change of variables in 
equation (15.7). 


15.3 The Coordinate Velocity V 1 


In this section we introduce the coordinate velocity V . It is an essential ingredient in 
defining the invariant velocity C , which is the fundamental quantity in the calculus 
of moving surfaces. The coordinate velocity V is defined as 


3Z' (t, 5) 
ft 


(15.15) 


It is the ambient component of the velocity of the coordinate particle with fixed 
coordinates 5". To show this, note that the position vector R tracking the coordinate 
particle 5“ is given by 


R(f,5 0 ) = R(Z(f,5 0 )). 


(15.16) 


Differentiating, we find 


3R (f, 5 0 ) _ 3R 3Z' (f,5 0 ) 
3 Z 7 ft 


3 1 


(15.17) 
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or 


V = V'Z i, 


(15.18) 


confirming that V 1 is the ambient coordinate of velocity. 

The projection of the coordinate velocity onto the surface is called the tangential 
coordinate velocity V a 


v a = V 1 Z". 


(15.19) 


Exercise 292. Compute V for the parametrization (15.5) of the ellipse evolution. 

Exercise 293. Compute V for the parametrization (15.6) of the ellipse evolution. 

Exercise 294. Note that the answers obtained in the two preceding exercises are 
different, indicating that V is not a tensor with respect to changes in surface 
coordinates. 

Exercise 295. Compute V a for the parametrization (15.5) of the ellipse evolution. 

Exercise 296. Compute V a ' for the parametrization (15.6) of the ellipse evolution. 

Exercise 297. Are the two answers the same? What does that tell you about the 
tensor nature of V a with respect to surface coordinate changes? 

Exercise 298. Show that V is tensor with respect to changes in ambient coordi- 
nates. 

Exercise 299. From the preceding exercise and equation (15.19), conclude that V a 
is also a tensor with respect to changes in ambient coordinates. 

Exercise 300. Show that the acceleration of a coordinate particle is given by 



(15.20) 


The preceding exercises dealt with one particular example of a change of surface 
coordinates. Let us now derive the general rules by which V and V a transform 
under a simultaneous change of ambient and surface coordinates. In the primed 


coordinates, V is given by 


3Z''(lS') 

dt 


(15.21) 


To relate V' and V 1 ' , substitute the relationships between the coordinate system into 
equation (15.21) 
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Expanding the right-hand side, we find by a repeated application of the chain rule: 

T ,i'_ a#' (dz*(t,s) , dz i (t,s l )ds a (t,s l )\ 

3Z' l 3/ + 3 t )' 

In other words, 

V 1 ' = V' jf + Z‘ a jf Jf . (15.24) 

Thus, we officially conclude that V is not a tensor with respect to changes in surface 
coordinates. Note two interesting aspects of equation (15.24). First, if we were to 
limit ourselves to time-independent changes of surface coordinates, then — since /“ 
would vanish — V would be tensor. Second, the nontensor part of the relationship 
in equation (15.24) is proportional to the shift tensor Z' a . Thus, the projection V Nj 
has a legitimate chance at being an invariant. It is, in fact, an invariant and is known 
as the surface velocity C and is introduced in the next section. 

Exercise 301. Show that coordinate velocities V and V associated with 
parametrizations (15.5) and (15.6) satisfy equation (15.24). 

Exercise 302. Show that V a transforms according to 

V a ' = V a Jf' + (15.25) 

Exercise 303. Show that coordinate velocities V a and V 01 ' associated with 
parametrizations (15.5) and (15.6) satisfy equation (15.25). 


15.4 The Velocity C of an Interface 

In the preceding section, we established that the coordinate velocity V in the a 
tensor on the account of the term Z' a J! Jf in equation (15.24). This term contains 
Z' a and is therefore orthogonal to the normal /V,- (more accurately: \',>). To take 
advantage of this, multiply both sides by Ni' 

V'' Nf = V' Jf' Ni/ + Z' a jf Ni'Jf . (15.26) 

Since J- = N the first term on the right-hand side equals NjV , while the 
second term vanishes. Thus, 


V i 'N i r = V i N i . (15.27) 

In other words, the normal component V N, of the coordinate velocity is invariant. 
This combination is called the velocity of the surface and is denoted by the letter C 
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Fig. 15.3 Geometric 
construction of the surface 
velocity C 



N 


St St+h 


C = V'Nj. 


(15.28) 


Note that the sign of C depends on the choice of the normal. 

The velocity C has a clear geometric interpretation illustrated in Fig. 15.3. 
Denote the surfaces at two nearby moments of time t and t + h by S t and ,S, + /, . 
Suppose that A is a point on S, and B is a point on .S’, that has the same surface 
coordinates as A. Thus, the vector from A to B is approximately V h. Let P be the 
point where the unit normal N of S r intersects S t +h - Since h is small, the angle APB 
is nearly 7t/2. The length of A P is approximately 


\AP | = V-N/i = Ch. 


(15.29) 


Therefore, C can be defined as the limit 



(15.30) 


and is interpreted as the instantaneous velocity of the interface in the normal 
direction. 

We have noted previously that the sign of C depends on the choice of the 
normal. This property of C is also evident in its geometric interpretation: the length 
\AP\ is considered positive if the vector A P points in the same direction as N 
and negative otherwise. This convention can be explicitly incorporated into the 
geometric definition as follows 



(15.31) 
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The geometric approach makes no reference to coordinate systems, and it 
is therefore clear that C is an invariant quantity. Furthermore, the geometric 
construction of C is consistent with the analytical definition, since for “small 
enough” h, AP is “nearly” an orthogonal projection of A B . There is no doubt that 
this argument can be turned into a rigorous proof of the equivalence. We have not 
presented a rigorous demonstration of the equivalence and therefore invite the reader 
to view equation (15.28) as the definition of C and equation (15.31) as an intuitive 
geometric insight. 

Although the quantity C is a scalar, it is called a velocity because the normal 
direction is implied. One may define the vector normal velocity C: 

C = CN. (15.32) 

The vector velocity C is convenient in several contexts, particularly equation 
(15.46). However, all the information regarding the motion of the surface is 
contained in the scalar quantity C . 

Exercise 304. Compute C for A = 1 at t = 1 for the evolution (15.5). 

Exercise 305. Compute C for A = 1 at t = 1 for the evolution (15.6). 


15.5 The Invariant Time Derivative V 

The covariant derivative V, was born out of the need to preserve the tensor property 
of objects, since the partial derivative 3/3 Z' did not have this property. The 
covariant surface derivative V„ had similar motivations. In the study of moving 
surfaces, the partial derivative 3/3 1 does not preserve the tensor property, and 
therefore a new operator is needed. 

When we defined the invariant surface velocity C, our initial approach was 
analytical and we later backed it up with geometric intuition. In the case of the 
invariant derivative operator V, we start with the geometric construction, which is 
analogous to that of C . It is illustrated in Fig. 15.4. 

Suppose that an invariant field T is defined on the surface at all times t. The 
field T could be the normal N, the mean curvature B“, a surface restriction of 
an invariant field defined in the ambient space, or anything else. The idea of the 
derivative operator V is to capture the rate of change in T in the normal direction. 
This is similar to how C measures the rate of deformation in the normal direction. 
Given a point A on 5,, find the points B and P as we did previously: B lies on S t +h 
and has the same coordinates as A, and P is the intersection of S t +h and the straight 
line orthogonal to 5/,. The geometric definition of VT involves only points A and 
P and corresponds to our intuition of instantaneous rate of change in the normal 
direction: 
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Fig. 15.4 Geometric 
construction of the invariant 
time derivative V as applied 
to invariant quantities 



vr = lim 

h->-0 


T (P) — T (A) 

h 


(15.33) 


The definition (15.33) is entirely geometric and therefore VT is an invariant. 
Our next goal is to find an analytical expression for V T in a particular coordinate 
system. By virtue of the geometric construction, the resulting expression is invariant. 
The desired expression can be determined by analyzing the value of T at point B . 
Since B has the same surface coordinates as A, we have 

dT (t S') 

T(B)^T(A) + h (15.34) 

at 

On the other hand, T ( B ) can be related to T ( P ) because B and P are nearby 
points on the surface S t +h- The relation reads 

T (B) ^ T (P) + hV a y a T, (15.35) 


since V a T captures the rates of change in T along the surface and hV a captures 
the directed distance from B to P . Eliminating T (B) from equations (15.34) and 
(15.35), we find 


T (P) - T(A ) « h 



E“v„r 


(15.36) 


Motivated by equation (15.36) we give the following definition of the invariant time 
derivative V when applied to invariants: 


VT 


3 T(t,S) 
dt 


V a V a T. 


(15.37) 


As before, we have not drawn a rigorous connection between the definitions (15.33) 
and (15.37). Therefore, we consider equation (15.37) as the primary definition and 
equation (15.33) as the geometric intuition behind the analytical definition. 
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Exercise 306. With the help of equations (15.25) and (15.14), show analytically 
that V T is an invariant. 

Exercise 307. Show that V satisfies the sum rule 


V(E + G) = VF + VG. (15.38) 

Exercise 308. Show that V satisfies the product rule 

V (FG) = VE G + F VG. (15.39) 

Exercise 309. Show that V satisfies the product rule with respect to the dot 

V (F ■ G) = VF • G + F • VG. (15.40) 

Exercise 310. Show that V applied to a constant field T (t.S) = A vanishes. 

Definition (15.37) applies only variants of order zero. In Chap. 16, we expand V 
to variants of arbitrary order. So far, we are able to apply it to the position vector R, 
the normal N and the mean curvature B" . We would now like to present the results 
of applying V to these objects. However, in the case of the normal N and the mean 
curvature B' ( ' the derivation is postponed until Chap. 16. 

By definition, VR is given by 


9R(t,S) 

VR = - - K V a R. (15.41) 

dt 

The first term on the right-hand side can be expanded by the chain rule 

9R (f, 5) _ 3R(Z (t, S)) _ 9R dZ'(t,S) _ 

dt dt d Z> dt { ' ’ 

as we saw previously. Thus, 


VR = V i Z, - V a S a , (15.43) 

or 

VR = (V‘ — V a Z i a ) Z,-. (15.44) 


Finally, combining equations (15.19) and (10.55), we find 

VR = V j NjN'Zi, 


(15.45) 
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or 


VR = CN = C. (15.46) 

This is the first equation from the fundamental differentiation table of the calculus 
of moving surfaces. 

Exercise 311. Derive equation (15.46) by the geometric definition that (15.33). 

Exercise 312. Fill in the details of the transition from equation (15.43) to (15.44). 

Exercise 313. Fill in the details of the transition from equation (15.44) to (15.45). 

As was mentioned earlier, we postpone the derivation of the formulas for VN and 
V B" until the following chapter. However, one relationship involving VN is easy to 
determine. Differentiating the identity N • N = 1 , we find by the product rule that 

N • VN = 0. (15.47) 

This relationship tells us that VN is orthogonal to the normal space. It must therefore 
lie in the tangent space and be represented by a linear combination of the covariant 
basis vectors Sq.. The coefficients of the linear combination turn out to be — V“C. 
The result is known as the Thomas formula 

VN = -S„V"C. (15.48) 

This formula first appeared in Tracy Thomas’s book [46]. Finally, we mention that 
the derivative V B“ is given by the following formula, perhaps the most beautiful 
among all calculus of moving surfaces relationships: 

V5“ = V«V"C + CB“Bt . (15.49) 

This formula can be found in [ 16]. 


15.6 The Chain Rule 

The rule discussed in this section is analogous to the chain rule for the covariant 
surface derivative V„ discussed in Sect. 11.8. Suppose that a field T is defined in 
the ambient space. Its surface restriction to the surface S is subject to the invariant 
derivative V. By definition. 


VT 


3 T(t,S) 


F“V„T. 


3 1 


(15.50) 
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If T is a surface restriction of an ambient field T (t, Z). then T (t, S) may be 
expressed as T (t , Z (f , S)). By the chain rule of multivariable calculus and tensor 
calculus on stationary manifolds, we find: 


dT (t, Z ) 3 T 3Z ! (t, S ) 

Ft + FzJ Ft 


-V a Z t a V i T. (15.51) 


Since dT/dZ' = V, T and 8Z‘ (t, S ) /dt = V‘, 

VT = ( V ‘ - F“Z ;) V, T. 


(15.52) 


As we showed above while deriving VR. the quantity in parentheses equals CN‘ . 
We have thus arrived at the chain rule of the calculus of moving surfaces: 



(15.53) 


This chain rule has a clear geometric interpretation illustrated in Fig. 15.3. It 
states that the difference in the quantity T between the points A and P is due to two 
influences: the difference of h DT /dt in the value of T due to the passage of time 
and the difference of hC N‘ V, T between the values of T at the points A and P at 
time t . 


15.7 Time Evolution of Integrals 

The remarkable usefulness of the calculus of moving surfaces becomes evident 
from two upcoming formulas (15.54) and (15.56) that govern the rates of change 
of volume and surface integrals due to the deformation of the domain. Many critical 
quantities in physics and engineering are represented by integrals. These quantities 
include mass, energy, entropy, electrical current, and so forth. In applications one 
often needs to find the rate of change in these quantities induced by changes in 
the domain. In fact, most problems in the calculus of moving surfaces — whether 
in shape optimization, boundary variations or physical applications — begin with an 
integral expression. Thus, equations (15.54) and (15.56) are indispensable tools in 
the analysis of moving surfaces. 

Consider a Euclidean domain £2 with boundary S evolving with surface velocity 
C . Figure 15.5 illustrates this discussion. Suppose that the invariant field, vector, or 
scalar, F (t . Z) depends on time and is sufficiently smooth within £2. Then the rate 
of change of the volume integral f n Fd £2 is given by the formula 



(15.54) 
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Fig. 15.5 Illustrations of the integral laws (15.54) and (15.56). The figure on the left shows the 
annexed region that has a thickness proportional to C . The thicker portion of the boundary of the 
figure on the right illustrates that the change in area is proportional to the mean curvature 


The volume term on the right captures the rate of change of F and the surface term 
captures the rate at which the moving boundary annexes or gives up the neighboring 
regions. The annexing term is illustrated on the left of Fig. 15.5. 

We give this law without a general proof, but I believe that this law is entirely 
intuitive. Notice the similarities between this equation and the fundamental theorem 
of calculus in the case when the integrand and integration domain depend on the 
parameter t : 

d r h(t) r b{t) a f (t x) 

— / F(t,x)dx = I — - — dx + b' (t) F (t,b (?)) . (15.55) 

dt J a J a dt 

In fact, as an exercise below shows, a special case of equation (15.54) follows easily 
from the fundamental theorem of calculus. 

The other fundamental formula addresses the surface integral f s FdS for an 
invariant, vector or scalar, defined on the closed surface. The rate of change of this 
integral is governed by the formula 



(15.56) 


This formula was first given in [16]. The first term once again corresponds to the 
rate of change in F. Notably, the operator that figures in this term is the invariant 


time derivative V ! It could hardly have been any other operator: the left-hand side 


of equation (15.56) is coordinate-free, and therefore the result ought to consist only 
of invariant elements. 

In Chap. 14, we proved that the surface integral of the unit normal over a closed 
surface vanishes 



(15.57) 
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As an example of using equation (15.56), we now show a special case of this 
equation by means of the calculus of moving surfaces. Namely, we show that the 
rate of change of this integral is zero 


/ N dS = 0. 

dt J s 


By equation (15.56), we have 


J N dS = J VN dS - Js CB“NdS. 


According to the Thomas formula (15.48) 


4- / NrfS 

at 


f N dS = - f S a V a CdS - f CB“NdS. 

J S J S J s 


By Gauss’s theorem, 


d 

dt j s 


f N dS = f V"S„ C dS - f CB^NdS. 
Js Js Js 


(15.58) 


(15.59) 


(15.60) 


(15.61) 


Because V“S„ = N /?“, the right-hand side vanishes and equation (15.58) is 
confirmed. Since the integral of the unit normal over a sphere vanishes, we have 
actually shown that it vanishes over all smooth surfaces of genus zero — that is, all 
closed surfaces that can be obtained by a smooth deformation of a sphere. 

Exercise 314. Show that 


4 - f N B a a dS = 0. 
dt Js 

You will enjoy this exercise as it draws upon virtually all of the surface identities, 
including the Codazzi equation. 

Exercise 315. Conclude that the integral 

I N Bys (15.62) 

Js 

vanishes over all closed smooth surfaces of genus zero. 

Exercise 316. This exercise gives the proof of a special case of the volume law 
(15.54). Suppose that, with a Cartesian coordinate system imposed, the planar 
domain £2 is the region between two fixed values of x in the horizontal direction, 
and between the x-axis and a positive time-dependent function b (t,x) in the vertical 
direction. Consider a time-independent function F (x, y) and its area integral 
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P I'M rb(t,x) 

1=1 FdQ. = / / F(x,y)dydx. (15.63) 

J £2 J A\ J 0 

Use the fundamental theorem of calculus to show dl/dt is given by formula 
(15.54). 


15.8 A Need for Further Development 


The definition (15.37) gives us an operator that produces invariants when applied to 
invariants. However, for tensors of order greater than zero, the result of applying V 
is not a tensor. The following exercises show this. Therefore, more work remains to 
be done. The important task of constructing such an operator is undertaken in the 
next chapter. 

Exercise 317. For a tensor T ‘ , derive the following transformation rule for the 
variant U' = 3 T' ( t , S) /dt: 


‘ )T i 

u " = uij < + + rj ‘ jVi + rj d z - j ‘- 


(15.64) 


Exercise 318. For a tensor 7) , derive the following transformation rule for the 
variant U, = 37) ( t , S ) /dt: 


u i ' = Ui4 + + Ti4 y v j ’ 


(15.65) 


Exercise 319. For a tensor T a , derive the following transformation rule for the 
variant U a = 3 T a (t,S)/dt: 


jja f TTU ra' . toi' , ra' , 7 -<a ra' tP 

U ~ U J a + Ysfi Ja + 1 Jat + 1 ■ 


(15.66) 


Exercise 320. For a tensor T a r , derive the following transformation rule for the 
variant U a / = dT a > ( t , S) /dt : 


u a , = u a r a , + + T a C t . 


(15.67) 


Exercise 321. For a tensor T l , derive the following transformation rule for the 
variant W 1 = dT‘ (t, S) /dt - U“V„r'': 


W’- =w i j; - r^v z/j* j[ +r jW + r 


(15.68) 
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Exercise 322. For a tensor 7) , derive the following transformation rule for the 
variant W, = 97) (/, S ) /dt - F“V a r,: 


Wi' = Wj J\, + T k n Zi a T k J? 4 + T i j\, j ,V i - J a a JfVa'Ti'. 


(15.69) 


Exercise 323. For a tensor T a , derive the following transformation rule for the 
variant W a = d T a (; t , S ) /dt - V^pT 01 : 


w a ' = w a jf - njyjfjs' + T°J? t + T a J a 'jf. 


(15.70) 


Exercise 324. For a tensor T a ' , derive the following transformation rule for the 
variant W a > = dT a / (t, S) /dt — V^Vp'T a r. 


w* = w a j$ + r Y afi T y j t p c + T a j«, t . 


(15.71) 


15.9 Summary 

In this chapter, we introduced the fundamental elements of the calculus of moving 
surfaces. We defined the velocity of the surface C and a new time derivative operator 
V that preserves the invariant property. The two fundamental formulas (15.54) and 
(15.56) for differentiating integrals over deforming domains showed the importance 
of developing the techniques of the calculus of moving surfaces. 

The elements of the calculus of moving surfaces presented in this chapter are 
only the beginning. The technique is not fully developed until the time derivative V 
is extended to variants of arbitrary indicial signature. This is the central task of the 
next chapter. 


Chapter 16 

Extension to Arbitrary Tensors 


16.1 Preview 

In Chap. 15, the invariant derivative operator V was defined for variants of order 
zero. The new operator proved to have a number of essential features including the 
tensor property — that is, producing tensor outputs for tensor inputs. Even with that 
narrow definition, the new operator demonstrated its impressive utility in equations 
(15.54) and (15.56) for evaluating the rates of change of volume and surface 
integrals. However, analysis of all but a few problems is impossible, unless the new 
derivative is extended to all objects encountered on surfaces which includes variants 
with arbitrary indicial signatures. The development of this extension is the subject 
of this chapter. 


16.2 The Extension to Ambient Indices 

The extension of V to variants with ambient indices can be induced if V is to satisfy 
the following properties, the sum and product rules, commutativity with contraction, 
and the metrinilic property with respect to the ambient basis Z, . 

Consider the variant T of order zero 

T=rZ;. ( 16 . 1 ) 

If the desired properties hold, then 

VT=Vr'Z / . (16.2) 

Exercise 325. Explain how each of the three desired properties contribute to the 
validity of equation (16.2). 
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On the left-hand side, the invariant derivative V is applied to a variant of order 
zero and is therefore given by 


VT = — - E“V a T. 
dt 

Substitute equation (16.1) and apply the product rule: 


n r J~'i dZ 

vt = -4—z t + r - v y v y r z,-. 

at at 


Exercise 326. Show that 


3Z; 


r ijV J Z k . 


(16.3) 


(16.4) 


(16.5) 


Therefore, 


3 T l 

VT = — Zi + T k rLv j Zi - rvj'z (16.6) 

ot J 

Equating this expression to the right-hand side of equation (16.2), we arrive at a 
reasonable definition of V for a variant T l with a contravariant ambient index: 

37* 

vr = — - V y W y r + V j T) k T k . (16.7) 

Exercise 327. Show that similar reasoning leads to the following definition for a 
tensor with a covariant ambient index 


V7) = — i - V r V y T t - V J V k n T k . (16.8) 

at J 

Motivated by equations (16.7) and (16.8) we give the following definition for a 
tensor Tj with a representative collection of ambient indices: 

3 77 

vr; = -^-~ v Y v y T' + v k T' km r? - v k v^r m . (16.9) 

Algorithmically, equation (16.9) is interpreted similarly to other definition of tensor 
differential operators. The term — V Y V y appears only once, and to each index there 
corresponds a term containing the Christoffel symbols. The new extended operator 
retains all of its properties, and it is left as exercises to demonstrate that it is so. 

Exercise 328. Show that VT' is a tensor for a tensor argument T' . You should 
show this in two ways: by deriving the rule by which V T‘ transforms under a change 
of variables and by analyzing equation (16.3). 
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Exercise 329. Similarly, show that V 7) is a tensor for a tensor argument 7} . 

Exercise 330. Show that V T‘ is a tensor for a tensor operand T‘- . Once again, there 
are two ways of showing this: by deriving the rule by which VTj transforms under a 
change of variables, and by leveraging the corresponding property for V T' or V 7] . 

Exercise 331. Conclude that V possesses the tensor property for arbitrary tensors 
with ambient indices. 

Exercise 332. Show that V commutes with contraction. 

Exercise 333. Show that V satisfies the sum and product rules. 

Exercise 334. Show that if T l - (t.Z) is defined in the entire ambient space, then 
the invariant derivative VT’j applied to its surface restriction T'- ( t,S ) satisfies the 
chain rule 

3T (t 7^) 

vr; = — - + CN k v k T). (16.10) 

1 at 1 

Exercise 335. Conclude that the invariant time derivative is metrinilic with respect 
to the ambient metrics: 

VZ i, V Z'=0 (16.11) 

VZ y , VS'., VZ y , Vfiyjt, S7s ijk = 0. (16.12) 


16.3 The Extension to Surface Indices 

Historically, the extension of the invariant time derivative to ambient indices 
followed a long and winding path. Several incorrect attempts were made until a 
proper definition was finally given in [16], where it referred to as the 8/ 8t -derivative. 
The operator V, as applied to variants with surface indices, was first defined in [20] 
where its advantages over the 8 / 8 1 -derivative was described. 

The construction of the operator V for the ambient indices can proceed in a way 
similar to the ambient indices. We stipulate that V satisfies the following properties: 
the sum and product rules, commutativity with contraction and, when applied to S„, 
the result proportional to the normal N: 

VS„ = A a N. (16.13) 

Consider the following variant T of order zero: 


T = T a S„. 


(16.14) 
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If the desired properties hold, then 

VT = VT“ S a + 7’“y4 a N. (16.15) 

The left-hand side can be analyzed by the definition of V as applied to invariants: 

3T 


VT 


dt 


V^V«T. 


(16.16) 


Substituting equation (16.14), we find 

d (T 01 S 1 

VT = — — — — — (T a S a ) , (16.17) 

at 

which, by the product rule, equation (11.16), and renaming a fS in the second 
term, yields 


O r T 1 Ct 3S 

VT = —S a + - V^pT a S a - V p T a KB Pa . 

Exercise 336. Show that 


(16.18) 


3S^ 

dt 


V/jV. 


(16.19) 


Exercise 337. Thus, 


9S 

-g^ = V^ (V a S a + CN) . (16.20) 

Exercise 338. Expand the right-hand side and show that 

= v p v a S a + K“N B pa + N VftC - CBpS a . (16.21) 

Exercise 339. Show that substituting this result into equation (16.18), yields 
dT a 

VT = — s„ + T>v f v s„ + rv„CN - t>cb; s . - s„. , 16 . 22 , 


Comparing this result to equation (16.17) yields the definition of V for T a 
3 T a 


VT" = 


dt 


- V p VpT a + (VpV 01 -CBfj T p , (16.23) 
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as well as the expected formula for VS a : 

VS„ = NV a C. (16.24) 

Exercise 340. Show that the same approach suggests the following definition for 
covariant arguments: 

vr„ = ^ - V^T. - (V a E^ - CB£) T P . (16.25) 


16.4 The General Invariant Derivative V 

We are now ready to give the full definition of V for arbitrary tensors. We combine 
equations (16.7), (16.8), (16.23), and (16.25) for each type of index and give the 
following definition for the operator V for a variant T 1 .^ with a representative 
collection of indices. Define the Christoffel symbol T^ for moving surfaces as 

f“ = SJpV a - CBp. (16.26) 

Then the definition for the operator V as applied to 77? is: 


VT‘“ — 
W 1 jf i - 


°jP 

dt 


-V y S7 y T$ + V k V 


’j-'tna i/k 

km ' 


pm '-ri 
1 kj 1 m 


’ I rffTltil pG 

! + ft 


T)%. (16.27) 


The proofs of these properties of V are left as exercises. 

1 . The operator V produces tensor outputs for tensor inputs 

2. The operator V commutes with contraction 

3. The operator V satisfies the sum and product rules 

4. The operator V satisfies the chain rule. That is, if T‘- (t, Z ) is defined in the entire 
ambient space, then V applied to its surface restriction T‘ ( t,S ) satisfies the 
chain rule 


. . = dT (f, Z) + CN ky kT i 

J at J 

5. The operator V is metrinilic with respect to the ambient metrics 

VZ,, VZ' = 0 

VZ y , VZ' 7 , VS}, Veyjfc, Ve ijk = 0. 


(16.28) 


(16.29) 

(16.30) 
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6. The invariant derivative V does not commute with the surface derivative Y„ . The 
general rule for the commutator V V„ — V„ V depends on the indicial signature 
of the variant to which it is applied. For variants T of order zero, the rule reads 

(VV H - V«V) T = -CB£V m T. (16.31) 

Next, we evaluate the invariant time derivative for the fundamental differential 
objects on moving surfaces. 


16.5 The Formula for VS a 

By the definition (16.27), we have 

WA - (V«F“ - CB“) S a . (16.32) 

By equation (11.16), this becomes 

vs„ = V a V - rN5„ - (V«F“ - CB“) S u . (16.33) 

Decompose Y in tangential and normal components: 

VS„ = V a (F“S W + CN) - V y NB ya - {W a V w - CB“) S M . (16.34) 

Apply the product rule to the first term. Several terms cancel and we find the final 
formula 


VS« = NV a C. (16.35) 

The metrinilic property of V with respect to Z, yields the following component form 
of equation (16.35) 


VZ' = N‘V a C. (16.36) 

This completes perhaps the most challenging derivation of this chapter. The rest 
of the fundamental differential relationships follow relatively easily from equations 
(16.35) and (16.36). 
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16.6 The Metrinilic Property of V 

The metrinilic property of the invariant derivative V is similar to that of the covariant 
surface derivative V a . That is, the metrinilic property holds when applied to ambient 
and surface metrics with the exception of S„ and S" . The metrinilic property with 
respect to the ambient metrics follows from the chain rule and has been established 
above. In this section, we focus on the metrinilic property with respect to the surface 
metrics. 

For the covariant basis S a p, we have 

<7S a p = V (S„ • S„) = VS„ • + S„ • VS^. (16.37) 

By equation (16.35), we find 

vs a p = m a c -Sp + s a -mpC. (16.38) 

Therefore, the result is zero by orthogonality of N and S„ : 

VS aP = 0. (16.39) 

To prove that V<5^ vanishes, we appeal to the definition (16.27): 

95“ 

+ r a j; - rpi (16.40) 

Each of the first two terms vanish, while the third and the fourth terms cancel each 

other. Therefore 


V<5^ = 0. (16.41) 

To show that VS“^ vanishes, apply V to the identity S a ^Sp y = 8". By equation 
(16.41) 


V (, S aP Sp y ) = 0. (16.42) 

By the product rule, 

VS aft Spy + S a P VSpy = 0, (16.43) 


and by equation (16.39), 


VS H/3 Spy = 0 


(16.44) 
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which immediately yields 


<7S aP = 0. (16.45) 

The metrinilic property of the V with respect to the surface bases S u p and S 01 ^ 
allows the juggling of indices under V. For example, by raising the covariant index 
in equation (16.35), we find 


VS“ = NV“C. 

Similarly, by raising the covariant index in equation (16.36), we find 

VZ'“ = A FV“C. 


Exercise 341. Show from equation (16.39) that 
dS a g 

= V a Vp + VpVa-2 CB a p 

and from equation (16.45) that 


dS “P 

~dF 


-YxyP - V^F" + ICB 01 ^ . 


Exercise 342. Show that 


— = 25 (V„F“ - CB a a ) 

and therefore 

^ = VS (V„F“ - CB a a ) . 


and 



1 

Vs 


(V„F“ - CB a a ) . 


Exercise 343. Show that 


dSccfi 

~dt~ 



(16.46) 


(16.47) 


(16.48) 


(16.49) 


(16.50) 


(16.51) 


(16.52) 


(16.53) 
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Exercise 344. Show that 


de afi 

~dt~ 


= s aft (v y V y - CB y \. 


(16.54) 


The two preceding exercises enable us to consider the Levi-Civita symbols e a p 
and e“^. By definition (16.27), 


^ &ufl 


de, 




dt 


- V r V y s a p - f“e^ - tpe ai j. 


(16.55) 


The first term is found in equation (16.53), the second term vanishes, and the 
remaining terms are expanded according to the definition (16.26) of IT : 


Ve^ = (V y vr - CB^j e a p - (V a F“ - CB“) e w p - (v p V a - CBf ) W 

(16.56) 

To show that the right-hand side vanishes, consider V e 1 2 : 

Ve 12 = (v K E>'-CB;) £l2 -(V 1 F“-C5r)ea ) 2-(V 2 F' u -CB 2 “)e l£ „. (16.57) 

Keep the nonzero terms on the right-hand side 

Vs 12 = ( 'v y v r - CB y - VjF 1 + CB\ - V 2 F 2 + CB^j e 12 . (16.58) 

It is clear that this sum vanishes which shows that Ve 12 = 0. The other four entries 
of the tensor V e a a can be analyzed in a similar fashion and we conclude that 

Ve a/j = 0. (16.59) 

The metrinilic property of V with respect to the contravariant Levi-Civita symbol 
e a P is left as an exercise. 

Exercise 345. Show that 


Ve“^ = 0. 


(16.60) 


16.7 The Formula for VN 


From equation (16.35) it is easy to obtain the formula for VN. Dot both sides of 
(16.35) with N : 


N • VS a = V a C. 


(16.61) 
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By the product rule, 

— VN • S„ = V„C. (16.62) 

Therefore 

VN = -S“V„C. (16.63) 

Equation (16.63) is called the Thomas formula for Tracy Thomas who was the first 
to derive it. In component form, the Thomas formula reads 

V/V' = -Z^V“C. (16.64) 

16.8 The Formula for VZ?“ 

p 

Start with equation 

Bp = -S“ • V^N, (16.65) 

which is the vector form of equation (12.41). Apply V to both sides: 

VBp = —VS" • V^N — S“ • VV^N. (16.66) 

By equation (16.31), we find 

<7B a p =-VS“-V /i N-S“-V /j VN-S“-CB^V ffl N. (16.67) 

Since the first and the third terms cancel, the formula for V B'i reads 

p 

VB a p = V"V /J C + CB“B Y . (16.68) 

This formula is found in [16]. Contracting the free indices yields the previously 
mentioned equation (15.49) 

V5“ = V«V“C + CB a pB%. (15.49) 

Exercise 346. Show that the formula for the derivative of the Gaussian curvature 
K reads 


V/f = B^V^C -B^V^C + CKB a a . (16.69) 

In the next chapter, this formula is used to prove a version of the Gauss-Bonnet 
theorem. 


16.9 Summary 
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16.9 Summary 


In this chapter, we extended the invariant time derivative V to variants arbitrary 
indicial signature. The general definition of V is given in equation (16.27). In 
summary, we give the differentiation table of the calculus of moving surfaces: 

VZ i , VZ' = 0 

(16.70) 

VZy, VZ' 7 , V<5' = 0 

(16.71) 

<• 

00 

?r 

<• 

co_ 

II 

o 

(16.72) 

VS a = N V a C 

(16.73) 

VN= -S a V a C 

(16.74) 

VZ' = N'V a C 

(16.75) 

VS a/) , VS afi , V&l = 0 

(16.76) 

Ve«/j, Ve“^ = 0 

(16.77) 

VB% = V a V p C + CBIB P 

(16.78) 


Chapter 17 

Applications of the Calculus of Moving Surfaces 


17.1 Preview 

The applications of the calculus of moving surfaces are remarkably broad. Of 
course, many of the applications come from problems in physics and engineering 
in which physical surfaces move. On the other hand, numerous applications come 
from problems in which, at least in the statement of the problem, there are no moving 
surfaces. There are at least three categories of such problems: shape optimization, 
boundary perturbation, and a third category illustrated by the proof of a version of 
the Gauss-Bonnet theorem. 

Shape optimization is a topic in the calculus of variations, in which a certain 
quantity is minimized with respect to a function. In shape optimization, the unknown 
function is the shape of a domain. The central idea in the calculus of variations is 
to reduce what is essentially an infinite-dimensional optimization problem to a one- 
dimensional problem by introducing a smooth family of functions, parameterized 
by a single variable t, and then searching for the best function in the given family. 
When this idea is carried over to shape optimization, a smooth family of functions 
becomes a smooth family of shapes. In other words, we are dealing with a moving 
surface ! 

A classical shape optimization problem in finding a surface of minimal surface 
area that has a given contour boundary. Such surfaces are called minimal. We show 
that minimal surfaces are characterized by zero mean curvature B" . We also derive 
the geodesic equation. 

Boundary perturbation problems often arise in the study of partial differential 
equations. Suppose that a boundary value problem has been solved on some domain 
Q and let \jt denote the solution. It is often necessary to solve a similar problem on 
another domain G*, the solution being \ jr* . An approximation to \j/* can be found 
by constructing an evolution of domains Q, (now we have a moving surface!) that 
starts as G at time zero and arrives at G* at a later time t* . Let f, be solution 
to the boundary value problem on fi,. Then at f = 0, \[r , coincides with fr and 
at t = t* , x// t coincides with \fr* . The calculus of moving surfaces provides tools 
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for expressing the derivatives of i fr t with respect to time. The derivatives of i fr t can 
typically be evaluated at time t = 0, thereby enabling us to form a Taylor series 
approximation for \fr* . 

In the last category of problems, moving surfaces are introduced as an artificial 
device. Consider the statement of the celebrated Gauss-Bonnet theorem. In its 
simplest form, it states that for a closed smooth surface S of genus zero (that is, 
without bagel-like holes), the integral of Gaussian curvature K is independent of 
the shape and equals 4 jt: 


KdS = 4jt. 


(17.1) 


s 


The integral of K is called the total curvature of the surface. It is easy to verify that 
the Gauss-Bonnet theorem holds for a unit sphere. For an arbitrary smooth surface 
S of genus zero, we can construct a smooth evolution from the unit sphere to S 
and show that the rate of change in the total curvature vanishes. Therefore, the total 
curvature of S is 47T as well. What a nice and unexpected application of the calculus 
of moving surfaces ! 


17.2 Shape Optimization 
1 7.2.1 The Minimal Surface Equation 

A classical problem in shape optimization is to find a surface S of least possible 
area that spans a prescribed contour boundary T in three dimensions. This problem 
has a long and rich history. A discussion of this problem is included in almost every 
text on the calculus of variations. The reader might recall a lengthy derivation of the 
governing equation based on the Euler-Lagrange equation. 

In shape optimization problems such as this one, the calculus of moving surfaces 
shines at its brightest (perhaps by contrast with the alternative methods). With the 
help of the calculus of moving surfaces, the governing equation is derived in a 
single line. Consider a one-parameter family of surfaces S (f), each spanned by 
a stationary contour T. In the following discussion, we omit the reference to the 
parameter t and denote the family of surfaces simply by S . The area A of S is given 
by the integral 



(17.2) 


Differentiating with respect to t, we find by equation (15.56) 



(17.3) 
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Since, for an optimal surface, dA/ dt must vanish for all independent variations and 
since C can be chosen arbitrarily, the equilibrium equation reads 

B“ = 0. (17.4) 

In words, a minimum surface has zero mean curvature. 

Two aspects of this derivation are noteworthy. First is its remarkable compact- 
ness, which shows that the calculus of moving surfaces is ideally suited to this type 
of problems. Second is that the derivation and therefore equation (17.4) are valid for 
hypersurfaces in a Euclidean space of arbitrary dimension. For example, in a two- 
dimensional Euclidean space, surface area corresponds to contour length. Therefore, 
not surprisingly, equation (17.4) tells us that a minimal surface is a straight line. 


17.2.2 The Isoperimetric Problem 

The classical isoperimetric problem is to find a surface of least surface area that 
encloses a given volume V. The constraint is accounted for by introducing a 
Lagrange multiplier A: 


E = J dS+X^J dQ.-v\. (17.5) 

The derivative of E reads 

d ^ = L c + A) ds ' (l7 ' 6) 

Therefore, the equilibrium equation reads 

B“ = A. (17.7) 

In words, a minimal surface that incloses a given volume has constant mean 
curvature. 

It is easy to see that a sphere of radius R, such that 4ttR ‘ ! /3 = V, satisfies this 
condition and the constraint, since for a sphere in three dimensions, B “ = —2/R, 
the equilibrium configuration has 

A = — — . (17.8) 

R 

The equilibrium equations (17.4) and (17.7) constitute necessary, but not suffi- 
cient conditions for a minimum. A shape satisfying equation (17.4) or (17.7) may 
yield a maximum or a stationary point. The sign of the second derivative d 2 E/ dt 2 
can help distinguish between local minima, maxima, and stationary points. 
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1 7.2.3 The Second Variation Analysis for the Isoperimetric 
Problem 

Apply equation (15.54) to (17.6) 

^ = f s V (C (-BZ + A)) dS - ^ C 2 B l (~B“ + A) dS. (17.9) 

By the product rule, the first integral becomes 

f VC (-5“ + A) JS+ ^ C V (-fi“+A) r/S - f C 2 B% (-5“ + A) tW. 

(17.10) 

We are interested in evaluating d 1 E/dt 1 for the equilibrium configuration that 
satisfies equation (17.7). Therefore, the first and the third integrals in (17.10) vanish: 

^ = f s CV(-B« + X)dS. (17.11) 

By equation (15.49), 

^ = ^(-CV„V“C -C 2 BpfydS. (17.12) 

By Gauss’s theorem, 

^ = ^(V„CV"C -C 2 B%B^dS. (17.13) 

The first term in equation (17. 13) is positive (or zero) and the second is negative for 
any nontrivial C . 

Equation (17.12) is best for determining the sign of d 2 E/dt 2 for a general 
variation C . Decompose C as a series in spherical harmonics Yj m 

c = Ci "' Y /*> ( 17 - 14 > 

l> 0, \m\<l 


where / = 0 is disallowed because it violates the volume constraint. Spherical 
harmonics Y/ m are eigenvalues of the surface Laplacian 

/(/ + 1 ) 

R 2 


V a V“T; m 


(17.15) 
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and form an orthonormal set on the unit sphere So 


I Yi\m\ YizrmdS 
JSo 


For a sphere of radius R , we have 


1 , if / 1 = I 2 and m 1 = m 2 
0, otherwise 


(17.16) 


Vi = jp- 

Substituting these relationships into equation (17.12), we find 

d 2 E \ , 

= 4 tt £ Cf m (I — 1 ) (/ + 2 ) . 

l>0,m 


dt 2 


(17.17) 


(17.18) 


The spherical harmonic Too (0. 4>) = (4jt) - 1 / 2 , which corresponds to / = 0, is 
excluded because f Y mt dS > 0 and it violates the constant area constraint. From 
this expression, we conclude that the sphere is neutrally stable (d 2 E /dt 2 = 0) with 
respect to translation as a rigid body (/ = 1) and is stable ( d 2 E/dt 2 > 0) with 
respect to all other smooth deformations. 


17.2.4 The Geodesic Equation 

Consider two points A and B on a two-dimensional surface S characterized by the 
metric tensor S a p . A geodesic is the shortest surface curve that connects A and B. 
Let the geodesic be given parametrically by 

S a = S a {U). (17.19) 

A common derivation [31] of the equation characterizing geodesics is based on 
formulating the Euler-Lagrange equations for the arithmetic integral that yields the 
length of the curve 


[P I dS a dSP , 

L = L <1720 ’ 

We outline a different approach based on minimal surfaces. After all, geodesic 
is a minimal hypersurface and must therefore be characterized by an equation 
analogous to (17.4). The difficulty lies in the fact that equation (17.4) is valid 
for minimal surfaces embedded in Euclidean spaces. Meanwhile, the surface S 
is not a Euclidean space. Nevertheless, we state (without proof) that the general 
definition (16.27) of the invariant time derivative V remains valid in Riemann 
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spaces. Furthermore, the law (15.56) governing the time evolution of surface 
integrals remains valid as well. Therefore, the derivation of Sect. 17.2. 1 continues to 
apply to minimal surfaces in Riemann spaces. We reach the conclusion that minimal 
surfaces in Riemann spaces are characterized by zero mean curvature. When 
applied to one-dimensional curves embedded in two-dimensional surfaces, the term 
mean curvature becomes geodesic curvature. This was discussed in Sect. 13.9. 
Thus, geodesics are characterized by zero geodesic curvature: 

b% = 0. (17.21) 

Equation (17.21) is known as the geodesic equation or equation of a geodesic. 

Let us translate equation (17.21) into an explicit equation for 5“ ( U ). Multiply 
both sides of equation (17.21) by n a 


b%n a = 0 


and recall that b%n a is given by 

b%n a = U* 9 V*S%. 

Therefore, equation (17.21) is equivalent to 

tf**V*S£ = 0. 

Expanding the covariant derivative V$, we find 

( ^^4 | c y 1 n 

u l ~dU® + * ~~ 1 * _ U ' 

In terms of S 01 (U), this equation reads 


( d 2 S a Q dS° \ 

l dU^dU' 5 ' ^dU^dU* ^ du n ) 


= 0. 


(17.22) 

(17.23) 

(17.24) 

(17.25) 

(17.26) 


Equation (17.26) characterizes minimal hypersurfaces of Riemann spaces of arbi- 
trary dimension. Note that ct is a free index and therefore there is a single equation 
for each 5“ (U ). In order to adapt this equation to the case of one-dimensional 
curves, denote by T the single entry of the Christoffel tensor F^.j, on the curve and 
cancel U u : 


d 2 S a ra dS? dS y - dS a 
dU 2 + ftY ~dU ~dU ~dU 


(17.27) 


Finally, if the curve is referred to arc length s, T vanishes and equation (17.27) reads 

d 2 S a dS? dS * _ 

ds 2 ds ds 


(17.28) 
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Equation (17.28) gives the form in which the geodesic equation usually appears. 
Letting 


'J'OL 


dS a 

ds 


(17.29) 


denote the unit tangent, equation (17.28) may be written in the form 

8T a 


(17.30) 


In words, the unit tangent forms a parallel vector field (in the sense of Sects. 8.9 
and 12.7) along a geodesic referred to the arc length .v. 


17.3 Evolution of Boundary Conditions in Boundary 
Value Problems 

A boundary value problem is solved on a domain fi with boundary S . The unknown 
function u is determined by a partial differential equation inside Q and a boundary 
condition prescribed on the boundary S. The classical boundary value problem is 
the Poisson equation 


V, V'h = /, 


(17.31) 


where / is a known function on £2 . 

There are three fundamental types of boundary conditions. Dirichlet boundary 
conditions prescribe the value of the function u along the boundary S . Our focus is 
the zero Dirichlet boundary conditions 

u\ s = 0. (17.32) 

Neumann boundary conditions prescribe the value of the normal derivative along S . 
Our focus is once again the zero Neumann condition 

A'V,m = 0. (17.33) 

Finally, mixed boundary conditions prescribe the value of a linear combination Au+ 
BN'NjU of the function u and its normal derivative N‘ V, u. The quantities A and 
B may be variable functions on the boundary. We focus on the case where A is a 
constant, B = 1 and the prescribed value is zero 


Au + N l V/ u = 0. 


(17.34) 
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Suppose that the domain £2 evolves with time t and that u (t, Z ) is the solution to 
a particular boundary value problem subject to one of the three fundamental types of 
boundary conditions. Our goal is to derive the boundary value problem that governs 
the rate of change 


ll\ = 


du ( t,Z ) 

Ft 


(17.35) 


of u(t,Z). Regardless of the differential relationship that u{t,Z) satisfies in the 
interior of £2, the boundary conditions for u\ can be determined from the boundary 
conditions for u. In each case, the boundary condition is derived by applying the V 
operator to the boundary condition for u. 

For Dirichlet boundary conditions (17.32), an application of V yields 


Vw = 0. 


(17.36) 


Expanding, we find 


du 

Ft 


+ CN'S/.u = 0, 


(17.37) 


or 


;/i = —CN' WjU. 


(17.38) 


Thus, we conclude that Dirichlet boundary conditions for u result in Dirichlet 
boundary conditions for u\. 

For Neumann boundary conditions (17.33), we find 

V (7V' V, m) = 0. (17.39) 

By the product rule and the Thomas rule (16.64), we find 

- Z;V“CV,h+ /V'VV,m = 0. (17.40) 

Apply the chain rule (15.53) to the expression VV, u: 

0 

VV.-M = — V,M + CN j VjS7 iU . (17.41) 

dt 

Since the covariant derivative V, commutes with the partial derivative 9/ dt, we have 


VV,« = V/u i + CN J V/ VjU. 


(17.42) 


We thus arrive at the following condition for u i 


N'WjUx = Z' V“ C V,- u-CN i N j V,- V y u. 


(17.43) 
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or, equivalently, 


jV'VfMi = V"CV„« - CN i N i W i V j u. (17.44) 

Thus, similarly to the Dirichlet case, Neumann boundary conditions for u lead to 
Neumann boundary conditions for it\ . 

Finally, we turn to mixed boundary conditions (17.34) for constant A and B. 
Applying the V operator to equation (17.34), we find 

AVm + BV(A'V,m) = 0. (17.45) 

Combining the analyses of the Dirichlet and Neumann cases, we find 

Aui + BN'VjU\ = —ACN'VjU + BV a CV a u - ( />’ V .V ' V; V.h. (17.46) 

Thus, mixed conditions for u lead to mixed conditions for u\ . 


17.4 Eigenvalue Evolution and the Hadamard Formula 

In this section, we study the dependence of Laplace eigenvalues on shape. More 
specifically, we analyze the rate of change in the eigenvalue induced by smooth 
deformations of the domain. The eigenvalues A are determined by the Laplace 
eigenvalue equation 


V,V'm = —Am (17.47) 

in combination with Dirichlet (17.32), Neumann (17.33), or mixed (17.34) boundary 
condition on the boundary S of the domain 12 on which the interior condition 
(17.47) is to be satisfied. The interior equation and the boundary condition, define 
the eigenfunction u to within a multiplicative constant. The usual way to remove 
this arbitrariness is to normalize the eigenfunction to unity according to equation 

I u 2 dO, = 1. (17.48) 

J n 

With the normalization condition in place, the eigenfunction u is determined to 
within a sign. 

When the domain £2, with its boundary S, evolve as a function of time t, so 
do the Laplace eigenvalues A. If we denote the evolution of the domain by £2 (?) 
and S (f) and the evolution of the eigenvalues by A (r), our goal is to determine the 
expression for the time derivative Ai of A (/): 

. dX (t) 

X\ — 


dt 


(17.49) 
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The eigenvalue A corresponding to the eigenfunction u is given by the Rayleigh 
quotient 


f a WudSl 

fs i u ~dQ. 


(17.50) 


Since we have introduced the normalization condition (17.48), the Rayleigh quotient 
reduces to its numerator 


A = f VjiiVucm. (17.51) 

Jn 

The Rayleigh quotient is the starting point of our analysis. However, let us save 
this analysis for last and first determine the relationships that the interior equation 
(17.47) and the normalization condition (17.48) imply about the time derivative u\ 
of the eigenfunction u: 


u\ (t, Z) = 


du (t, Z) 
ft 


The function u\ is called the first variation of the eigenfunction u. 
Differentiate the interior equation (17.47) with respect to time 


(17.52) 


d d 

- V, V' u (t, Z) = - - (A (0 u 0 1 , Z» . (17.53) 

It is left as an exercise to show that the covariant and contravariant derivatives V, 
and V' commute with partial time differentiation. The right-hand side is expanded 
by the product rule. The result is 


V, V'«i = — Aim — A«i. (17.54) 

Transferring the term — Xu\ to the left-hand side, we obtain the final equation for u\ 

ViV'Mj + Am] = —A \u. (17.55) 


Therefore, the first variation of the eigenfunction satisfies an inhomogeneous 
Helmholtz equation. 

The normalization condition (17.48) also gives us information regarding u\ . 
Differentiate equation (17.48) with respect to time t: 


d_ 

dt 



= 0 . 


(17.56) 
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By equation (15.54), we find 


/ 2uu\dQ. + 

Jn Js 


2uu\dQ, + / CifdS = 0. 

Is 


(17.57) 


Under Dirichlet conditions (17.32), the surface integral vanishes identically in 
time. Therefore, under Dirichlet conditions, the first eigenfunction variation ii\ is 
orthogonal to the eigenfunction u in the sense 


/ uu\dQ. = 0. 
Jn 


(17.58) 


We are now in a position to derive the Hadamard formula for A i . Let us begin 
with Dirichlet conditions. As we started before, our starting point is the Rayleigh 
quotient (17.51). Applying the ordinary time derivative, we have 


d C 

A i = — — / V,nV'MG?f2. 

dt Jn 


(17.59) 


By the volume integral law (15.54), we have 

3 (V, mV' n) 


Ai = [ ' ' 'dQ- / CS/juVudS. 

Jn vt 


L 


(17.60) 


By a combination of the product rule and the fact that partial differentiation 
commutes with covariant differentiation, we find 


Ai — / 2 V,- u VLq d -f- 

Jn 


J CWiuVudS. (17.61) 

Use Gauss’s theorem to transfer the covariant derivative V' from u to V, u t : 


Ai 


= 2 [ u NjVindS -2 I 

Js Jn 


L 


mV,- V' ni<7S2 + / CVjuVudS. 


(17.62) 


By the Dirichlet condition (17.32) the first surface integral vanishes. Furthermore, 
equation (17.54) gives us an expression for V, V';/ 1 . Thus 


A ! — 2 / (A \u^ -\- A uit\} d £ 2 -)- / 
Jn Js 

Split the first integral into two: 


C V.mV' udS. 


Ai = 2Ai / u 2 d£l + 2A / uu\dQ. + / CV,hV'h<75. 

Jn Jn Js 


(17.63) 


(17.64) 
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The first term equals 2X\ by the normalization condition (17.48). The second term 
vanishes by equation (17.58). We have therefore arrived at the following expression 
for Ai 


Aj = - f CV,-wVWS. (17.65) 

Js 

This result is known as the Hadamard formula. 

Exercise 347. Show that, under sufficient conditions of smoothness, the partial 
time derivative d/dt commutes with the covariant derivative V, n. 

Exercise 348. The Laplace eigenfunctions corresponding to the simple eigenvalues 
on the unit circle under Dirichlet boundary conditions are given by 


u (r,9) = 


Jo(pr) 

V*Ji (p)' 


(17.66) 


where J„ are Bessel functions of the first kind and p is a root of Jq. Note that the 
Bessel functions Jo and J\ are connected by the relationship 


Jo (x) = —J i (x) . (17.67) 

Consider the following evolution from the unit circle to an ellipse with semi-axes 
1 + a and 1 + b: 


( x(t,a) = (1 +at)cosa (17 68) 

( y ( t,a ) = (1 + bt) sin a 

Show that at t = 0, according to the Hadamard formula, we have 

Xi=~(a + b)X. (17.69) 

Exercise 349. Show that equation (17.69) can be derived without the calculus of 
moving surfaces by appealing to Euler’s homogeneous function theorem. (Hint: 
Laplace eigenvalues are homogeneous functions of the ellipse semi-axis lengths). 

We now derive the expression for u\ under Neumann conditions. We once again 
start with the Rayleigh quotient and, by applying d/dt , arrive at the following 
equation 


Ai 


/ 

Jn 


2 VjuV l UidQ + / CVjmV z udS. 


L 


(17.70) 


This time, we use Gauss’s theorem to transfer the covariant derivative V, from iq to 
V'm: 
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Ai = 2 f uiNiYudS^ 2 f miV,VWQ + / CV,mVWS. (17.71) 
Js Jn Js 

The first integral vanishes by the Neumann condition (17.33). In the second integral, 
substitute —Xu for V, V';/ by equation (17.47): 

X\ = 2X I uu\d£l + I C^iuS/'udS. (17.72) 

Jn Js 

Since the first integral is given by equation (17.57), we arrive at the final expression 

Aj = / C (V,mV'm - Xu 2 ) dS. (17.73) 

Js 

This equation is known as the Hadamard formula for Neumann boundary 
conditions. 

Derivation of the Hadamard formula under mixed conditions is left as an 
exercise. 

Exercise 350. Show that, under all boundary conditions among (17.32), (17.33), 
and (17.34), 


Ai 



Nj V'mi) dS. 


(17.74) 


Exercise 351. Use the divergence theorem to convert equation (17.74) to the 
following form 


X x = \ (mV, V' —u\ V, V' u)dQ,. (17.75) 

Jn 

Exercise 352. Derive the Hadamard formula for mixed boundary conditions 

X f =[ C (— V, mV' m + 2Vq,mV“m - Xu 1 + B*u N, V' m) dS (17.76) 

Js 

Exercise 353. Show that the Hadamard formula (17.76) reduces to equation 
(17.65) under Dirichlet conditions and equation (17.73) under Neumann conditions. 


17.5 A Proof of the Gauss-Bonnet Theorem 

Our proof of a special case of the Gauss-Bonnet theorem is a particularly neat 
application of the calculus of moving surfaces. After all, the Gauss-Bonnet theorem 
is a result in classical differential geometry on stationary manifolds. There are no 


292 


17 Applications of the Calculus of Moving Surfaces 


moving surfaces in sight! Nevertheless, we reformulate the Gauss-Bonnet theorem 
as a problem in moving surfaces. The presented proof is excellent as a final 
demonstration in this book as it draws on many of the fundamental relationships 
in tensor calculus and the calculus of moving surfaces. 

The Gauss-Bonnet theorem states that the integral of Gaussian curvature over a 
closed surface is independent of its shape and only depends on its genus. The genus 
( genera in plural) of a surface is a topological concept. In plain language, the genus 
of a closed surface is the number of holes that it has. Thus, a sphere is a surface 
of genus zero and a torus is a surface of genus one. If g is the genus of the closed 
surface S, the Gauss-Bonnet theorem states that 


l 


KdS = An (1 — g) . 


Thus, for surfaces of genus zero 


(17.77) 


and for surfaces of genus one 


l 

L 


KdS = 47T 


KdS = 0. 


(17.78) 


(17.79) 


Equation (17.77) is a special case of the Gauss-Bonnet theorem. A more general 
Gauss-Bonnet theorem holds for nonclosed surfaces with contour boundaries. 
Furthermore, the Gauss-Bonnet holds for general two-dimensional Riemannian 
manifolds in the intrinsic sense — that is, without a reference to an embedding. 
Finally, in topology, the Gauss-Bonnet theorem is formulated in an even more 
general setting that makes no reference to the Riemannian metric. 

We can prove our special case of the Gauss-Bonnet theorem by showing that the 
integral of Gaussian curvature 


T = [ KdS, 

Js 

known as the total cun’ature, is unchanged under sufficiently smooth evolutions. 
This would show that T is independent of shape, but does not quite prove that T = 
4tt (1 — g). However, for surfaces of genera zero and one, this can be verified by 
calculating T for a sphere and a torus. For surfaces of higher genera, the result can 
be argued (although this is not as simple as it may seem) by induction by annexing 
a torus to a surface of genus g to obtain a surface of genus g + 1 . 

So, our goal is to show that, under a smooth evolution, the total curvature T is 
unchanged: 


17.6 The Dynamic Fluid Film Equations 
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7 . 


— I KdS = 0. 
at 


By the surface integral law (15.56), we have 


— [ KdS = f VKdS - [ 
dt J s Js Js 


CKB a dS. 


Recall from Chap. 16 that VAT is given by the beautiful formula 
S7K = - BpV a S7 p C + CKB a a . 

Therefore, equation (17.81) becomes 


(17.80) 


(17.81) 


(16.69) 


d_ 

dt 


l 


KdS 



- 5£V a V^c) dS. 


(17.82) 


We next apply Gauss’s theorem to each of the terms in the integral on the right- 
hand side. In the first term, we use Gauss’s theorem to transfer from V'” to B“ . 
In the second integral, Gauss’s theorem helps transfer V„ from V^C to B“. Since 
the surface is closed, there arise no contour integrals. As a result, we arrive at the 
following expression 


j s KdS = j s (— V/j VtCdS. 


(17.83) 


We next use the Codazzi identity 


= VpB ay . (12.73) 

derived in Chap. 12. By raising the index y and contracting it with a, we find 

= V P B“. (17.84) 

Therefore, the integrand in (17.83) vanishes and equation (17.80) is confirmed! 


17.6 The Dynamic Fluid Film Equations 

Finally, we come to a dynamic application of the calculus of moving surfaces, and 
one that is perhaps closest to the author’s heart. We discuss the exact equations for 
the dynamics of free fluid films under the influence of surface tension. We consider a 
fluid film that spans a stationary three-dimensional smooth curved contour. The free 
fluid film is modeled as a two-dimensional surface S with a thickness represented 
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by variable surface mass density p. We decompose the fluid film velocity Y in the 
normal component C and the tangential components V a 


V=CN+Y“S a . (17.85) 

The potential energy of the fluid film is captured by Laplace’s model of surface 
tension 

P = a I dS. (17.86) 

Js 

Therefore, the Lagrangian L is given by 


L = - [ p(C 2 + V a V a )dS -a f dS. (17.87) 

2 Js Js 

The dynamic system [17-19] reads: 

Vp + V a (pV a ) = pCB a a (17.88) 
P (VC +2V a V a C + B aP V a V^ =oB a a (17.89) 
VL“ + vNpV 01 -CV a C -CV^Bl =0 (17.90) 


The first equation (17.88) represents conservation of mass. Equations (17.89) and 
(17.90) are the momentum equations for the normal and tangential components. The 
normal velocity C vanishes at the boundary F 


C = 0 


(17.91) 


while the in-surface components V a are tangential to the boundary 

n a V a = 0. (17.92) 

As an illustration, we demonstrate two key properties of these equations: 
conservation of mass and conservation of energy. The total mass M of the fluid 
film is given by the integral 


M 


= J pdS. 

Thus, according to the surface law (15.56), M changes at the rate 

= f S/pdS - [ P CB a a dS. 

Js Js 


(17.93) 


dM 

dt 


(17.94) 
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An application of equation (17.88) yields 


dM 

dt 



(17.95) 


Therefore, by the Gauss theorem, 


dM 

dt 


L 


n a pV a dS 


(17.96) 


This integral vanishes according to the boundary condition (17.92) and conservation 
of mass is therefore proven. 

To prove conservation of energy, apply the surface law (15.56) to the total energy 


The result is 


E = 



+ V a V a )dS + o 


L 


dS. 


(17.97) 


E= f ( sVp (c 2 + v a v a ) + p (cvc + V a VV a ) \ ds 
Js\ 4 pCBl(C 2 +V a V a )-oCB“ J 


(17.98) 


Use equations (17.88)— (17.90) to eliminate Vp, VC and V V" in this equation. 
Upon the cancellation of several terms we arrive at the following equation 


^ = J (pV?) (C 2 + V a V a ) - pCV a V a C - pV^VpV^ dS. 

(17.99) 

An application of Gauss’s theorem cancels the rest of the terms and the resulting 
boundary term vanishes due to the boundary condition (17.92). This completes the 
proof on the conservation of energy. 

Exercise 354. Combine equations (17.89)— ( 17.90) into a single second-order 
vector equation. 


17.7 Summary 

In this final chapter of the book, you experienced the versatility of the calculus 
of moving surfaces. Interestingly, we presented the calculus of moving surfaces 
in a relatively narrow setting: embedded surfaces in a Euclidean space. There 
exists an extension of the calculus of moving surfaces to surfaces of arbitrary 
dimension embedded in deforming Riemannian spaces. Nevertheless, even the 
relatively narrow version of the calculus of moving surfaces presented here finds 
a broad range of applications. 
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